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METHOD AND SYSTEM FOR IMPLEMENTING 
WEIGHTED VECTOR ERROR ECHO 
CANCELLERS 

FIELD OF THE INVENTION 

[0001] The present invention relates generally to echo 
cancellers and, more particularly, to weighted vector error 
echo cancellers for transceivers with unequal bandwidths. 

BACKGROUND OF THE INVENTION 

[0002] The advent of the Internet and the widespread 
popularity of personal computers have created an unprec- 
edented demand for high bandwidth networks. Generally, 
Internet applications, from simple email to real time video 
conferencing, from web surfing to interactive movies, from 
interactive games to virtual TV stations, from online trading 
to online gambling, demand a higher bandwidth communi- 
cation network. A fundamental challenge for the communi- 
cation industry is to provide a reliable and affordable high 
bandwidth communication link to all types of Internet users. 
Various competing wire-line, wireless, and optical broad- 
band technologies are deployed to partially meet the ever- 
increasing demand for higher bandwidth. The fastest grow- 
ing broadband technology is the Digital Subscriber line 
(DSL) technology, which provides a high bandwidth 
always-on connection over standard twisted pair copper 
media of the conventional telephone network. Among other 
wire-line media, coaxial cables are capable of providing 
always-on connections, however, its presence is insignifi- 
cant compared to millions and millions of wired telephone 
customers who are connected by a twisted pair of copper 
wires. Other technologies, such as satellite, wireless, and 
optical, either provide limited coverage, limited bandwidth, 
or are too expensive for deployment to individual customers. 
As a result, DSL technology is uniquely positioned to 
provide the broadband link between individual customer 
premise and the central office, the so-called last-mile of the 
high-bandwidth communication network. 

[0003] DSL is the fastest growing among emerging broad- 
band technologies for very good reasons. First of all, DSL 
utilizes the existing copper wire network infrastructure. 
Secondly, compared to the voice modems, such as V34 and 
V90, used in most personal computers that provide up to 56 
kbps dial-up connection, DSL provides a high bandwidth 
always-on connection with typical connection speeds from 
384 kbps to 6 Mbps. Moreover, DSL is affordable with easy 
installation, simple turn-up, and high service reliability. The 
successful deployment of DSL is capable of providing 
digital broadband connection to anyone with an analog 
telephone line. 

[0004] DSL services have been standardized over time by 
regional organizations such as, American National Standard 
Institute (ANSI), European Telecommunication Standard 
Institute (ETSI), and by world telecommunication organi- 
zation International Telecommunication Union (ITU). These 
DSL standards define data communication protocols to 
connect customer premise equipment (CPE) to the central 
office (CO) and to provide connections to various networks, 
such as DSL service providers, virtual private networks 
(VPN), or the Internet. Various forms of digital data (e.g., 
voice, video, and data) can be transported using DSL tech- 
nology. For transport of voice, DSL equipment is connected 



to the public switched telephone network (PSTN). For 
transport of video and data, DSL equipment uses the Internet 
via an Internet service provider (ISP). Voice over DSL 
(VoDSL) is capable of providing computer-to-computer, 
computer-to-telephone, and telephone-to-telephone voice 
services using an integrated access device (IAD). Video over 
DSL includes transport of MPEG-1 or MPEG-2 files, video 
conferencing using Internet Protocol (IP) standard such as 
ITU H323, WebCam, and video mail. In addition, DSL 
supports simple data transport, e.g., bearer services, for 
virtual private network (VPN), leased data line such as Tl 
and El, Point-to-Point Protocol (PPP), Asynchronous trans- 
fer mode (ATM), and Internet Protocol (IP). 

[0005] Like other communication technologies, DSL has 
gone though a major evolution over the last decade and a 
collection of technologies, commonly referred to as xDSL, 
are developed under the umbrella of DSL One type of 
subscriber loop digital transmission technology involves an 
integrated services digital network (ISDN), which has 
replaced a significant portion of the analog phone lines in 
Europe and Japan. ISDN offers integrated voice and data 
services and connection speed up to 144 kbps. Due to the 
high cost of deployment, an alternative solution called 
integrated digital loop carrier (IDLQ was deployed in 
United States. However, resulting data rates were considered 
inadequate for individual customers. As a result, advanced 
DSL technologies were developed including HDSL, SDSL, 
ADSL, HDSL2, SHDSL, and VDSL, all of which are 
capable of connection speed in excess of 1 Mbps. These 
advanced DSL technologies were developed to address 
different needs and application demands, while serving 
different market segments. For example, SHDSL is a sym- 
metric service designed for long reach office applications 
with connection speed of 1.5 Mbps, whereas, VDSL is 
designed to provide a very high-speed asymmetric service 
for a short-range applications. 

[0006] An achievement of DSL technologies is the use of 
fewer wires for transmission. For example, compared to 
high speed subscriber line (HDSL) technology, developed in 
the late '80s that requires a minimum of 4 copper wires, 
HDSL2, ADSL, and SHDSL all use 2 copper wires. Trans- 
mitting and receiving signals at the same time over the same 
wire-pair achieves this reduction in the number of copper 
wires. This technique is known as full-duplex transmission. 
Conventional wire line communication systems use an ana- 
log four-to-two wire conversion circuit, called a hybrid 
circuit, to achieve full duplex transmission over a single wire 
pair. In this configuration, transmit and receive paths share 
the same wire pair through the hybrid circuit As a result, the 
transmit signal leaks into the receive path whenever perfect 
balance is not achieved in the hybrid circuit. The part of the 
transmit signal that leaks into the receive path is referred to 
as an echo signal. 

[0007] Generally, echo is created when a transmitted sig- 
nal is reflected back through a hybrid circuit, which includes 
a four-to-two wire interface that comprises a transformer 
and a balanced load to match the average impedance of the 
various lines, into the near-end receiver. A typical passive 
analog hybrid generally provides less than 20 decibels (dB) 
of average return loss from the transmitter to the near-end 
receiver, while more than 70 dB is required to achieve high 
rate data transmission applications. The echo signal, if not 
adequately suppressed, can severely degrade the perfor- 
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mance of the full-duplex digital transmission system. This is 
especially true for medium and long loop lengths where the 
received signal strength can be lower than that of the echo 
signal due to the line loss. Unfortunately, in any real 
application the line characteristics are variable and 
unknown. Thus, it is difficult to exactly match the line 
conditions by a passive analog hybrid circuit. Under these 
conditions, a traditional analog hybrid circuit provides an 
insufficient level of echo suppression. Echo cancellation 
generally involves separating two signals traveling simulta- 
neously in opposite directions in the same wire pair over the 
same frequency bands. Estimating and canceling the echo 
produced by the leakage of the transmitted signal into the 
receiver may achieve separation. Echo cancellation may be 
used to increase the reach or noise margin of an ADSL 
system by allowing both upstream and downstream channels 
to use the low frequency portion of twisted pair spectrum. 

[0008] Digital echo cancellers are commonly used to 
suppress the echo to an adequate level and to recover the 
performance loss due to the echo. Hence, digital echo 
cancellers are an integral part of high performance full- 
duplex systems. In wire line communication systems with 
full-duplex transmission over a single wire pair, digital echo 
cancellers are commonly used to suppress the echo of the 
transmit signal entering the receive path. In an echo-can- 
celled system, the transmit signal may be filtered using an 
echo cancellation filter (ECF) to generate a reliable copy of 
the echo signal before subtracting it from the received 
signal. Implementation of digital echo cancellers is well 
understood for applications such as symmetric DSL (e.g., 
High-Speed Digital Subscriber Line HDSL2 or Cshdsl) 
with identical transmit and receive rates. However, for 
applications such as ADSL and Very High Bit Rate Digital 
Subscriber line (VDSL) with asymmetric data rates, the 
sample rates of transmit and receive paths need to be 
adequately bridged or matched. Though such rate matching 
function is crucial for efficient implementation of the echo 
cancellers, currently there is no effective approach to 
achieve rate matching. 

[0009] For example, a full rate ADSL system, such as that 
specified by the International Telecommunications Union 
(ITU) Standard G .992.1, may support full duplex transmis- 
sion over a single wire pair. To achieve such full duplex 
transmission, digital echo cancellers may be implemented 
for ADSL to suppress the echo of the transmit signal 
entering the receive path. However, ADSL is generally 
designed to support asymmetric data rates, e.g., transmit and 
receive rates are not identical for an ADSL transceiver. In 
particular, at a Customer Premise Equipment (CPE), the 
receive data rate is generally higher than the transmit data 
rate. Moreover, G.992.1 specifies different transmit band- 
widths for the CPE according to the region specific Annexes. 
For example, the Annex A is designed for North American 
deployment, Annex B is for European countries and annex 
C is for Japan. Thus, echo cancellers may be required to 
bridge such disparity between sample rates of transmit and 
receive paths. Though such rate matching function is crucial 
for efficient implementation of echo cancellers, there is no 
obvious approach to achieve rate matching. 

[0010] A much longer adaptive echo canceller filter oper- 
ating at the higher receive sample rate may be used to 
achieve satisfactory results at the cost of increased total 
computation, power, and silicon area. Furthermore, with 



longer filters, the training algorithm is more prone to 
diverge. Another alternative involves using frequency-do- 
main echo cancellation schemes, which requires a more 
complex hardware/software design and is not as flexible as 
the time-domain filter approach, in terms of steady state 
tracking, for example. 

[00U] The use of a time-domain adaptive echo cancella- 
tion filter operating at the higher sample rate of the transmit 
and receive paths is standard practice in communication 
system design. For Discrete multi-tone (DMT) systems, 
alternative echo cancellation schemes may include data- 
driven multi-tone echo cancellers, frequency domain echo 
cancellation, and discrete multi-tone echo cancellation. Tra- 
ditional echo cancellation schemes generally use a combi- 
nation of time and frequency-domain operations to reduce 
the computational complexity associated with the training 
and operation of a conventional echo canceller. Furthermore, 
these traditional echo cancellation schemes generally 
require extra design in hardware and/or software to access 
frequency-domain information. These frequency-domain 
based designs also have potential problems in terms of 
steady-state tracking. A significantly larger echo cancella- 
tion filter (ECF) operating at a higher rate is generally 
required. Moreover, the conventional approach requires a 
higher precision arithmetic. All of which require more 
hardware resources and computational power. 

[0012] Therefore, there is a need in the art of echo 
cancellers for a more efficient method and system for 
providing echo cancellation in dual rate applications and 
near optimal and low complexity adaptive implementations. 

SUMMARY OF THE INVENTION 

[0013] Aspects of the present invention overcome the 
problems noted above, and realize additional advantages. 
One such inventive aspect provides methods and systems for 
implementing weighted vector error echo cancellers for 
transceivers with unequal bandwidths. One aspect of this 
invention relates to an efficient echo cancellation scheme, 
referred to as Weighted Vector Error Echo Canceller 
(WEVE-EC) for applications where the receive path has a 
higher sampling rate. The WEVE-EC architecture may be 
implemented along with an adaptive algorithm, which may 
be based on Least Mean Square (LMS) update rules. Various 
other adaptive algorithms may be used to train the WEVE- 
EC of the present invention. An Error Weighting Multi- 
input-multi-output Filter (EWMF) of the present invention 
provides a flexible weighting scheme on most or all the 
sampling phases of the error signal. 

[0014] According to an embodiment of the present inven- 
tion, a weighted vector error echo canceller comprises an 
adaptive echo canceller filter having an input adapted to 
receive a transmit signal, the adaptive echo canceller filter 
being adapted to generate a signal representative of an echo 
signal associated with the transmit signal, wherein the 
adaptive echo canceller filter functions at a transmit rate; and 
an interpolation filter bank having an input adapted to 
receive the representative signal, the interpolation filter 
being adapted to generate a filtered output vector at a 
plurality of branches, wherein the filtered output vector is 
subtracted from a receive signal vector for generating a 
residual echo vector and wherein the plurality of branches 
corresponds to the receive signal vector. 
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[0015] Other aspects of the present invention may include 
an error weighting multi-input-multi-output filter having an 
input adapted to receive the residual echo vector, the error 
weighting multi-input-multi-output filter being adapted to 
generate a weighted error vector for training the adaptive 
echo canceller filter; wherein the adaptive echo canceller 
filter is a finite impulse response filter; a reference signal 
interpolation filter having an input adapted to receive a 
transmit signal, the reference signal interpolation filter being 
adapted to generate a reference signal vector; a vectorization 
unit having an input adapted to receive an input signal from 
an analog front end, the vectorization unit being adapted to 
generate a receive signal vector; at least one delay unit for 
delaying the input signal by a predetermined number of 
samples; at least one down sampling block for decimating 
the input signal by a predetermined factor; wherein training 
functionality associated with the adaptive echo canceller 
filter functions at the transmit rate; wherein updating func- 
tionality associated with the adaptive echo canceller filter 
functions at the transmit rate; and wherein the training 
functionality involves adapting a plurality of coefficients of 
the adaptive echo canceller filter based on at least one of a 
plurality of up sampled reference sequences and weighted 
error signal. 

[0016] According to an embodiment of the present inven- 
tion, a method for implementing a weighted vector error 
echo canceller comprises the steps of receiving a transmit 
signal; generating a signal representative of an echo signal 
associated with the transmit signal at a transmit rate; receiv- 
ing the representative signal; and generating a filtered output 
vector at a plurality of branches, wherein the filtered output 
vector is subtracted from a receive signal vector for gener- 
ating a residual echo vector and wherein the plurality of 
branches corresponds to the receive signal vector. 

[0017] Other aspects of the present invention may include 
the steps of receiving the residual echo vector and generat- 
ing a weighted error vector for training an adaptive echo 
canceller filter; wherein the adaptive echo canceller filter is 
a finite impulse response filter; the step of generating a 
reference signal vector; the steps of receiving an input signal 
from an analog front end; and generating a receive signal 
vector; the step of delaying the input signal by a predeter- 
mined number of samples; the step of decimating the input 
signal by a predetermined factor; wherein training function- 
ality associated with the adaptive echo canceller filter func- 
tions at the transmit rate; wherein updating functionality 
associated with the adaptive echo canceller filter functions at 
the transmit rate; and the step of adapting a plurality of 
coefficients of the adaptive echo canceller filter based on at 
least one of a plurality of up sampled reference sequences 
and weighted error signal. 

BRIEF DESCRIPTION OF THE DRAWINGS 
[0018] The present invention can be understood more 
completely by reading the following Detailed Description of 
the Invention, in conjunction with the accompanying draw- 
ings, in which: 

[0019] FIG. la is a block diagram illustrating a digital 
echo canceller, according to an embodiment of the present 
invention. 

[0020] FIG. 16 is a system diagram illustrating a digital 
echo canceller implementation, according to an embodiment 
of the present invention. 



[0021] FIG. lc is a system diagram illustrating a TEQ 
implementation, according to an embodiment of the present 
invention. 

[0022] FIG. Id is a system diagram illustrating modem 
transmission, according to an embodiment of the present 
invention. 

[0023] FIG. 2 is a block diagram illustrating an echo 
canceller, according to an embodiment of the present inven- 
tion. 

[0024] FIG. 3 is a block diagram illustrating an example 
of a dual rate echo canceller, according to an embodiment of 
the present invention. 

[0025] FIG. 4 is a block diagram illustrating an example 
of a dual rate echo canceller, according to an embodiment of 
the present invention. 

[0026] FIG. 5 is a block diagram of a CPE side dual rate 
echo canceller for simultaneous support of multiple annexes, 
according to an embodiment of the present invention. 

[0027] FIG. 6 is an example of an impulse response of IF 
and AAF, according to an embodiment of the present inven- 
tion. 

[0028] FIG. 7 is a convolution of IF and AAF impulse 
response, according to an embodiment of the present inven- 
tion. 

[0029] FIG. 8 is a block diagram illustrating an example 
of a weighted vector error echo canceller, according to an 
embodiment of the present invention. 

[0030] FIG. 9 is a flowchart illustrating an update process, 
according to an embodiment of the present invention. 

[0031] FIG. 10a is a block diagram of a TEQ system, 
according to an embodiment of the present invention illus- 
trating a fractionally spaced TEQ. 

[0032] FIG. 10b is a block diagram of a TEQ system, 
according to an embodiment of the present invention illus- 
trating a sample spaced TEQ. 

[0033] FIG. 10c is a block diagram of a TEQ system, 
according to an embodiment of the present invention illus- 
trating a hypothetical model of a delay sample transmit 
signal for determining TEQ filter coefficients. 

[0034] FIG. U presents a flow chart of the method in 
accordance with the current invention. 

[0035] FIG. 12 presents a flow chart describing a method 
of calculating the TEQ filter vector of the present invention. 

[0036] FIG. 13 presents a flow chart of the steps of 
calculating T1R ({bj) and TEQ ({wj) filter coefficients 
using MLC-TEQ. 

[0037] FIGS. 14a and 146 are schematic diagrams of 
hardware architectures in which the inventive aspects of the 
present invention may be incorporated. 

[0038] FIG. 15 is a block diagram of a physical media 
dependent layer of a ADSL CPE chip in which the inventive 
aspects of the present invention may be incorporated. 

[0039] FIG. 16 is a block diagram of a transmission 
convergence layer of a ADSL CPE chip in which the 
inventive aspects of the present invention may be incorpo- 
rated. 
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[0040] FIG. 17 is a block diagram of an analog front end 
device in which the inventive aspects of the present inven- 
tion may be incorporated. 

[0041] FIGS. ISa-lHc are block diagrams of applications 
in which the inventive aspects of the present invention may 
be incorporated. 

LIST OF ACRONYMS 

[0042] A/D — Analog To Digital 

[0043] AAF — Anti-Aliasing Filter 

[0044] ADC— Analog to Digital Converter 

[0045] ADSL — Asymmetric Digital Subscriber Line 

[0046] AECF — Adaptive Echo Cancellation Filter 

[0047] AFE— Analog Front End 

[0048] AGC— Automatic Gain Control 

[0049] AOC— ADSL Overhead Control 

[0050] ANSI— American National Standard Institute 

[0051] ARM— Advanced RISC Machine 

[0052] ASIC — Application-Specific Integrated Cir- 
cuit 

[0053] ATM — Asynchronous Transfer Mode 

[0054] BIU— Bus Interface Unit 

[0055] CIR — Channel Impulse Response 

[0056] CLECs — Competitive Local Exchange Carri- 
ers 

[0057] CLK — Clock 

[0058] CO— Central Office 

[0059] CPE — Customer Premise Equipment 

[0060] CRC— Cyclic Redundancy Check 

[0061] CRI^-Clock Recovery Loop 

[0062] CSR— Control and Status Registers 

[0063] DAC— Digital to Analog Converter 

[0064] D/A — Digital To Analog 

[0065] dB^Decibels 

[0066] DMA — Direct Memory Access/Addressing 

[0067] DMT— Discrete Multi-Tone 

[0068] DP — DMT Processor 

[0069] DSB — Down Sampling Block 

[0070] DSL — Digital Subscriber Line 

[0071] DSLAM— DSL Access Multiplexor/Module 

[0072] DSP — Digital Signal Processing 

[0073] EEPROM — Electrically Erasable Program- 
mable Read Only Memory 

[0074] EC — Echo Canceller 

[0075] ECF— Echo Cancellation Filter 

[0076] EOC— imbedded Overhead Control 



[0077] EPB— External Peripheral Bus 

[0078] Eth PHY— Ethernet Physical Layer 

[0079] ETSI — European Telecommunication Stan- 
dard Institute 

[0080] EWMP— Error Weighting Multi-input-multi- 
output Filter 

[0081] FDD — Frequency Division Duplex 

[0082] FEC — Forward Error Correction 

[0083] FEQ — Frequency Domain Equalization 

[0084] FIFO— First In First Out 

[0085] FIR— Finite Impulse Response 

[0086] GPIO— General Purpose Input/Output 

[0087] HDLC-^iigb-Level Data Link Control 

[0088] HDSL— High-Speed Digital Subscriber Line 

[0089] HIU— Host Interface Unit 

[0090] IAD— Integrated Access Device 

[0091] ICE — Information and Content Exchange 

[0092] ICE— In-Circuit Emulator 

[0093] ICI — Inter Channel or Carrier Interference 

[0094] IDLC— Integrated Digital Loop Carrier 

[0095] IF- Interpolation Filter 

[0096] I/F — Interface 

[0097] IFB— Interpolation Filter Bank 

[0098] tFFT — Inverse Fast Fourier Transform 

[0099] ILECs — Incumbent Local Exchange Carriers 

[0100] IP — Internet Protocol 

[0101] IPF— Interpolation Filter 

[0102] ISDN— Integrated Services Digital Network 

[0103] ISI— Inter Symbol Interference 

[0104] ISOS— Integrated Software on Silicon 

[0105] ISP-^ntemet Service Provider 

[0106] ITU — International Telecommunications 
Union 

[0107] IXCs — Interexchange Carriers 

[0108] JEDEC— Joint Electronic Device Engineer- 
ing Counsel 

[0109] HAG— Joint Test Action Group 

[0110] LD — Line Driver 

[0111] LMS — Least Mean Square 

[0112] LPF — Low Pass Filter 

[0113] LTI — Linear Time-Invariant 

[0114] MAC— Media Access Control 

[0115] MCM — Multi Chip Module 

[0116] Mil— Media Independent Interface 
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[0117] MIPS— Million Instructions Per Second 

[0118] MIMO— Multi-Input Multi-Output 

[0119] MMSE — Minimum Mean Squared Error 

[0120] MSE— Mean Squared Error 

[0121] NP— Network Processor 

[0122] Km— Network Timing Reference 

[0123] OAM — Operation, Administration and Main- 
tenance 

[0124] OSI — Open Source Initiative 

[0125] PAC — Programmable Attenuation Control 

[0126] PCI — Peripheral Component Interconnect 

[0127] PGA — Programmable Gain Amplifier 

[0128] PHY — Physical Layer Device 

[0129] PMS— Physical Media Specific 

[0130] PP— Protocol Processor 

[0131] PPP— Point-to-Point Protocol 

[0132] PROM — Programmable Read Only Memory 

[0133] PSD— Power Spectral Density 

[0134] PSTN-^ublic Switched Telephone Network 

[0135] RCDR-EC— Reduced Complexity Dual Rate 
Echo Canceller 

[0136] RMB— Rate Matching Block 

[0137] RMB-RT — Rate Matching Block from 
Receive to Transmit 

[0138] RMB-TR— Rate Matching Block from Trans- 
mit to Receive 

[0139] ROM— Read Only Memory 

[0140] RS — Reed Solomon 

[0141] RSIF— Reference Signal Interpolation Filter 

[0142] Rx — Receive 

[0143] SD— Sigma-Deita 

[0144] SDRAM — Synchronous Dynamic Random 
Access Memory 

[0145] SNMP — Simple Network Management Pro- 
tocol 

[0146] SNR — Signal To Noise Ratio 

[0147] SRAM — Static Random Access Memory 

[0148] STM — Synchronous Transfer Mode 

[0149] SVD— Singular Value Decomposition 

[0150] TC — Transmission Convergence 

[0151] TEQ — Time Domain Equalizers 

[0152] HR— Target Impulse Response 

[0153] Tx— Transmit 

[0154] UART — Universal Asynchronous Receiver 
Transmitter 



[0155] USB— Up Sampling Block 

[0156] USB— Universal Serial Bus 

[0157] VCXO— Voltage Controlled Crystal Oscilla- 
tor 

[0158] VDSI^-Very High Bit Rate Digital Sub- 
scriber Line 

[0159] \bDSL— \bice over Digital Subscriber Line 

[0160] \foIP — Voice over Internet Protocol 

[0161] VPN— Virtual Private Network 

[0162] VU— Vectorizauon Unit 

[0163] WEVE-EC— Weighted Vector Error Echo 
Canceller 

DETAILED DESCRIPTION OF THE 
INVENTION 

[0164] The following description is intended to convey a 
thorough understanding of the invention by providing a 
number of specific embodiments and details involving echo 
cancellers. It is understood, however, that the invention is 
not limited to these specific embodiments and details, which 
are exemplary only. It is further understood that one pos- 
sessing ordinary skill in the art, in light of known systems 
and methods, would appreciate the use of the invention for 
its intended purposes and benefits in any number of alter- 
native embodiments, depending upon specific design and 
other needs. 

[0165] According to an embodiment of the present inven- 
tion, a dual rate echo canceller is provided for applications 
with asymmetric transmit and receive rates. In particular, the 
present invention provides Reduced Complexity Dual Rate 
Echo Canceller (RCDR-EC) architecture along with low 
complexity Least Mean Square (LMS) update rules. Accord- 
ing to one aspect, the present invention is directed to an echo 
canceller that provides rate matching functionality for appli- 
cations (e.g., ADSL, VDSL, etc.) with asymmetric data 
rates. 

[0166] The RCDR-EC architecture and LMS update rules 
of the present invention provide various features and advan- 
tages. For example, RCDR-EC may operate at the lower of 
the two rates, receive and transmit, requiring less computa- 
tion per data sample. RCDR-EC implements a significantly 
smaller length echo cancellation filter (ECF) for achieving 
the same (or similar) level of echo suppression of conven- 
tional implementations. This reduces the hardware require- 
ment for implementation of RCDR-EC. Due, in part, to the 
reduction in ECF length, RCDR-EC may involve signifi- 
cantly less computation for filtering operation in steady 
state. The reduced rate and smaller echo canceller length 
enables RCDR-EC training to be simplified where less 
computation is required for adaptive training of the RCDR- 
EC. In addition, inherent out-of-band noise rejection capa- 
bility of RCDR-EC enables adaptive training to require less 
time to converge. Moreover, out-of-band noise rejection 
enables the adaptive training algorithm to yield an improved 
RCDR-ECF resulting in enhanced echo suppression. 

[0167] The present invention provides an efficient rate 
matching echo canceller for applications with asymmetric 
transmit and receive rates, such as ADSL and VDSL, for 
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example. Another aspect of the present invention provides 
echo canceller structures for cases involving higher and 
lower receive rates as compared to the transmit rate. Another 
aspect of the present invention is directed to developing a 
low complexity LMS update for adaptive training of the 
echo cancellation filters. In addition, the present invention 
reduces hardware requirements for implementation of the 
echo canceller and reduces computational requirements for 
steady state operation of the echo canceller. As a result, 
overall performance of the echo canceller is improved. 

[0168] A basic block diagram of the digital echo canceller 
structure 100, which may be employed in central office 
(CO), CPE or other equipment, is shown in FIG. la. A 
transmit filter 110 receives an input transmit signal from a 
transmit path and generates a filtered transmit signal. The 
output of transmit filter 110 may be received by analog front 
end (AFE) 112, which is comprised of digital to analog 
converter and various analog components including line 
drivers, for example. Echo cancellation filter (ECF) 122 
receives an input from the output of the transmit filter and 
generates a copy of an echo signal by a linear filtering 
operation. The output generated by AFE 112 may be 
received at an input of Analog Hybrid 114, which is com- 
prised of various passive analog components, for example, 
and may be coupled to a twisted wire pair or other trans- 
mission line. A receive signal may be received by analog 
hybrid 114, which may be coupled to AFE 116 at a receive 
side. For example, analog hybrid 114 may include common 
analog components, such as resistors and capacitors. The 
output generated by AFE 116 is received at an input of 
receive filter 118. The receive filter performs linear filtering 
operation and attempts to suppress out of band noise. The 
copy of the echo signal generated by ECF 122 is subtracted 
from the receive signal generated by receive filter 118, at 
summer 120. The resulting residua] signal may be used to 
train echo cancellation filter 122, such as by feeding a 
sampling signal 121 into echo cancellation filter 122. 

[0169] As shown in FIG. la, digital echo cancellation 
may be achieved by obtaining a reliable copy of the echo 
signal, which may be generated from the transmit signal 
using a filter, such as a Finite Impulse Response (FIR) ECF, 
as illustrated by 122. The copy of the echo signal is then 
subtracted from the received signal. The FIR ECF 122 
operates as a connecting branch between the transmit and 
receive paths. For example, the echo canceller filter receives 
an input from the transmit path and generates an output for 
the receive path. Hence, the ECF 122 accommodates the rate 
transition from the input transmit rate to the output receive 
rate. For applications, such as symmetric DSL (e.g., HDSL2, 
Cshdsl, etc.), where the transmit and receive data rates are 
identical there is generally little or no need for rate transi- 
tion. As a result, implementation of ECF is straightforward 
in these cases. However, for applications such as ADSL and 
VDSL and for other applications that support asymmetric 
transmit and receive data rates, rate transition may be 
required at the ECF. As addressed by the present invention, 
there are different approaches to achieve the rate transition 
within the ECF. In accordance with an embodiment of the 
present invention, ECF 122 may incorporate the inventive 
features as discussed in detail below. 

[0170] FIG. lb is a system diagram illustrating a digital 
echo canceller implementation, according to an embodiment 
of the present invention. In particular, system 130 illustrates 



echo cancellation functionality in a DSL modem system. CO 
side DSL modem is represented by 132, which may include 
a DSL modem 134 transmitting to and receiving from 
Hybrid and Analog Front End 138. Echo cancellation filter 
136 compensates for a portion of the transmit signal that is 
received as an echo signal as shown by arrow 137. CO side 
DSL modem 132 may be connected to CPE DSL modem 
142 by a twisted copper pair 140. CPE DSL modem 142 may 
include a DSL modem 144 transmitting to and receiving 
from Hybrid and Analog Front End 148. Echo cancellation 
filter 146 compensates for a portion of the transmit signal 
that is received as an echo signal as shown by arrow 147. 
The inventive aspects of the echo cancellation filter as 
described in detail below may be incorporated into echo 
cancellation filter represented by 136 and 146. 

[0171] FIG. 1c is a system diagram illustrating a TEQ 
implementation, according to an embodiment of the present 
invention. In particular, system 150 represents TEQ in a 
DMT based DSL modem. CO Side DSL Modem 152 may 
include DMT transmitter 154, Transmit Filter 156, Analog 
Front End 158, Receive filter 160, TEQ 162 and DMT 
receiver 164. CO Side DSL Modem 152 is coupled to CPE 
DSL modem 168 via twisted copper pair 166. CPE DSL 
modem 168 may include DMT transmitter 170, Transmit 
Filter 172, Analog Front End 174, Receive filter 176, TEQ 
178 and DMT receiver 179. The inventive aspects of the 
TEQ features as described in detail below may be incorpo- 
rated into TEQ represented by 162 and 178. 

[0172] FIG. Id is a system diagram illustrating modem 
transmission, according to an embodiment of the present 
invention. At the CO end, various sources of data may be 
transmitted via a DSL modem, as represented by 182, which 
may further include DSL Access Module, DSLAM, for 
example. Internet 184 may provide data, such as video and 
other forms of data, via Internet Service Provider 183. ATM 
Network 186 may provide data, such as video and other 
forms of data, via ATM Switch 185. Public Switch Network 
(PSTN) 188 may provide voice data to Central Office Switch 
187. DSL modem 182 may receive the various forms of data 
and communicate via a twisted copper pair 181 at a CPE 
device. At the CPE end, data from DSL modem 182 may be 
received by DSL modem CPE 192. Vbice data may be 
received by splitter 190 for transmission to a telephone 191 
or other voice device. DSL modem CPE 192 may transmit 
data to a computer 193 or other device. Video data may be 
received by set top box 194 for transmission to television 
(TV) 195 or other video device. The inventive aspects of the 
echo cancellation features as well as the TEQ features may 
be incorporated in this system, in accordance with the 
various embodiments of the present invention as discussed 
in detail below. 

[0173] Generally, the receive data rate is a rational mul- 
tiple of the transmit data rate, as illustrated by 

[0174] where a«P/Q for co-prime integers P and Q (e.g., 
P and Q do not have any common factors). R^, and 
represent the sampling rates at the output of the transmit and 
the receive filters, respectively. In one example, col may 
represent cases, such as ADSL CPE, where the receive rate 
is higher than the transmit rate. On the other hand, a<l may 
represent cases, such as ADSL CO, where the receive rate is 
lower than the transmit rate. Depending on the value of a, 
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different Reduced Complexity Dual Rate Echo Canceller 
(RCDR-EQ structures may be implemented. 

[0175] FIG- 2 is a block diagram of an echo canceller, 
according to an embodiment of the present invention. Sam- 
pling may occur by rate matching as represented by 220. In 
one example, FIG. 2 represents an echo canceller for 
applications where the receive rate is higher than the trans- 
mit rate. A transmit filter 210 may receive a transmit signal 
and generate a filtered transmit signal. The output of the 
transmit filter 210 may be coupled to an analog front end or 
other device. A copy of an echo signal associated with the 
transmit signal may be up-sampled by Up Sampling block 
212 to match the sample rate to the receive rate. The 
up-sampled signal may then be received by Echo Cancel- 
lation Filter 214 to produce an output at the receive rate. The 
resulting output may be subtracted from a received signal 
filtered by Receive filter 216. Conventional echo cancellers 
generally operate the ECF at the higher receive rate. As a 
consequence, the ECF, which may be a Finite Impulse 
Response (FIR), may require a larger number of taps to span 
a fixed length of time. As a result, extra hardware or software 
complexities will result. In accordance with the present 
invention, Echo Cancellation Filter 214 may include the 
inventive aspects of the invention, as discussed in detail 
below. 

[0176] FIG. 3 illustrates a block diagram of a Reduced 
Complexity Dual Rate Echo Canceller (RCDR-EC), accord- 
ing to an embodiment of the present invention. In particular, 
FIG. 3 illustrates a case where the receive rate is higher than 
the transmit rate. As shown in FIG. 3, a transmit filter 310 
may receive a transmit signal from a transmit path and 
generate a filtered transmit signal x(n). Transmit filter 310 
may operate al a transmit rate (R^J. The output x(n) of the 
transmit filter 210 may be received by an analog front end 
or other device. Echo Cancellation Filter (ECF) 312 may 
include an adaptive FIR filter that receives an input from the 
transmit path and generates a copy of an echo associated 
with the transmit signal, as shown by xl(n). ECF 312 may 
operate at transmit rate (R^). Since both the input and output 
of ECF 312 are sampled at the lower transmit rate (R^), the 
ECF 312 operates at the lower transmit rate (RtJ. 

[0177] Rate Matching Block from Transmit to Receive 
(RMB-TR) 320 up-samples the output xl(n) of ECF 312 by 
a factor a with appropriate filtering wherein ct=P/Q. The 
RMB-TR 320 may further include sub blocks, such as an Up 
Sampling Block (USB-P) 322 with up sampling factor P, an 
Interpolation Filter (IPF) 324, and a Down Sampling Block 
326 (DSB-Q) with down sampling factor Q. Other sub- 
blocks may be implemented. 

[0178] An output xl(n) of the ECF 312 may be received 
by the USB-P 322. An up sampling function at the USB-P 
322 may be achieved by a zero filling operation, e.g., the 
USB-P inserts P-l zeros between consecutive input signal 
samples. Other operations for achieving up sampling may be 
implemented. Thus, USB-P 322 may generate samples P 
times faster than the input sample rate R^. 

[0179] An output x2(l) of the USB-P 322 may be received 
by the IPF 324. The IPF 324 may include a fixed FIR filter 
that interpolates the up-sampled signal. Essentially, IPF 324 
may serve as a low pass filter for filtering out most or all P-l 
replicas of the transmit signal generated from the up sam- 
pling function. The IPF 324 may operate at P times the 
transmit rate R^ or other variation thereof. 



[0180] An output x3(l) of the IPF 324 may be received by 
the DSB-Q 326. The DSB-Q 326 may down sample the 
output of the IPF 324 by a factor Q by dropping Q-l samples 
from the input data stream and generating a signal at the 
receive rate R^. 

[0181] An incoming receive signal from an Analog Front 
End may be received by Receive Filter 314, which operates 
at a receive rate R^. Receive Filter 314 may then generate 
a filtered receive signal y(m). The output x4(m) of the 
RMB-TR 320 may then be subtracted from the filtered 
receive signal y(m) from Receive Filter 314 at summer 316 
to generate a residue echo signal e^m). 

[0182] Rate Matching Block from Receive to Transmit 
(RMB-RT) 330 may down sample the residue echo signal 
e r (m) by a factor a (or other factor) with appropriate filtering. 
Similar to RMB-TR 320, the RMB-RT 330 may include sub 
blocks, such as Up Sampling Block (USB-Q) 332 with up 
sampling factor Q, an An ti- Aliasing Filter (AAF) 334, and 
a Down Sampling Block (DSB-P) 336 with down sampling 
factor P. 

[0183] The residue echo signal e r (m) may be received by 
the USB-Q 332. The up sampling at the USB-Q 332 may be 
achieved by a zero filling operation, e.g., the USB-Q 332 
inserts Q-l zeros between consecutive input signal samples. 
Other operations for achieving up sampling may be imple- 
mented. Thus, USB-Q 332 may generate samples Q times 
faster than the input sample rate 

[0184] An output el(l) of the USB-Q 332 may be received 
by the AAF 334. The AAF 334 may include a fixed low pass 
FIR filter that receives the up-sampled output of the summer 
316 and filters out most or all the high frequency signals 
above the transmit band before feeding it to a down- 
sampling block. The AAF 334 may operate at Q times the 
receive rate R^. 

[0185] An output e20) of the AAF 334 may be received by 
the DSB-P 336. The DSB-P 336 may down sample the 
output of the AAF 334 by a factor P (or other factor) by 
dropping P-l samples from the input data stream and 
generating an error signal e(n) at the transmit rate R^. 

[0186] In one particular embodiment, for RCDR-EC of the 
present invention, the up sampling is accomplished after the 
ECF 312, as opposed to the conventional approach where up 
sampling is done before EFC 312. 

[0187] The transmit signal x(n) may first be filtered by a 
N-tap FIR ECF where 

( W l> W 2> W * - W n}> 

[0188] represents N adaptable ECF filter coefficients. The 
output of the ECF may be written as 



k-l 



[0189] where x^ is the data vector 

XJ^D), X(»-1), .(D-2), . . . <D-N + 1)F, 

[0190] w is the coefficient vector, 

[0191] and T denotes the transpose operation. 
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[0192] According to another embodiment of the present 
invention, the RMB-TR 320 may up-sample xl(n) by a 
factor a. For example, for every P input samples the 
RMB-TR 320 may generate Q output samples where a=P/Q. 
As discussed above, the up sampling at the USB-P322 may 
be achieved by a zero filling operation. Other operations for 
achieving up sampling may be implemented. The output 
x2(l) of the USB-P 322 may be expressed as 



[0202] The up-sampled signal el(l) may be filtered by the 
AAF 334, which may be a FIR filter, to generate the filtered 
version e2(l), which may be expressed as 



(*/(*) i( l=Pn 
0 otherwise 



[0193] In other words, x2(l) may include the samples of 
xl(n) with P-1 zeros inserted in between, e.g., samples of 
x2(l) may be given as follows: 



— 0. xl(n - IX 0, 0, — . 0 > xl{n\ 0, •■■ 

P-1 area 



[0194] Different time indices may be used to represent 
different sampling rates, e.g., n, 1, and m may be used to 
represent rates PR^ (or equivalently, QR„)> and R^, 
respectively, for example. 

[0195] The output of a N f tap FIR IPF 324 may be given 
by 



[0196] where 

{f to f» k - - - W 
[0197] represent the N f IPF coefficients. 

[0198] The output x3(l) of the IPF 324 may be down 
sampled using DSB-Q 326 where the output may be repre- 
sented by 

x4(m)-x3(l)> where 1-Qm. 

[0199] In other words, x4(m) may include decimated 
samples of x3(n), where the samples of x4(n) may be written 
as 

. . . , *KQG-1)), x3(Ql), x3«Kl+l)), . . . 

[0200] The output of RMB-TR 320 may be subtracted 
from the incoming receive signal y(m) from 314 to generate 
the residue echo signal e^m), which may be defined as 

[0201] The residue echo signal e r (m) may be up-sampled 
by USB-Q 332 by zero filling (or other operation) where the 
output may be given by 

tf/(/) _j*XmX if t = Qm 
1 0 otherwise 



[0203] where 

{&» &2> 6* - * - Sn 8 j 

[0204] are a Ng AAF coefficients. 

[0205] The output ^(l) of the AAF 334 may be down 
sampled using DSB-P 336 to generate the error signal e(n) 
which may be defined as 

e(n)-e2(iX where 1-Pn. 
[0206] IPF 324 and/or AAF 334 may include low pass 
filters that filter out the high frequency signals above the 
transmit band. According to an embodiment of the present 
invention, the same (similar or related) coefficients may be 
used for both IPF and AAF, e.g., f k =g k for all k. Also, the 
samples of the error signal may be generated at the transmit 
rate R^. The error signal e(n) may then be used to adaptively 
train the coefficients of ECF 312, as discussed below. 

[0207] To describe the LMS update rules for the RCDR- 
EC structure of the present invention, the error signal e(n) 
may be expressed in terms of the transmit signal x(n) and the 
receive signal y(m) as well as IPF and AAF coefficients. For 
example, the error signal e(n) may be written as 

e(o)-y I (n)-h'% 1 W, 

[0208] where y r (n) is the received signal component after 
USB-Q 332, AAF 334, and DSB-P 336, w is the ECF 
coefficient vector, 

jt(n) Jt(n-1) — + 

x(n-l) M.n-2) •-■ x(n-N) 
*• : : : : 

jc(n-L+l) xHft-L) — x\/i - L- N ±2) 



[0209] is the LxN input data matrix, and 

hfhj, h* h* . . . hj , 

[0210] is the filter coefficients vector that represents the 
combined effect of IPF 324 and AAF 334 along with the 
effects of USB-P 322, DSB-Q 326, USB-Q 332, and DSB-P 
336. 

[0211] The DSB-Q 326 and USB-Q 332 may have a 
sub-sampling effect on the IPF 324 and AAF 334 coeffi- 
cients, e.g., the equivalent filters may include one of the 
possible Q phases of the original filters. For example, f k Q 
may denote one of possible Q phases of the IPF filter 324, 
where 

AV Q0t -, Ml ,for^l,2,...N f Q , 

[0212] In addition, x 1» 2, 3, . . . , Q denotes the selected 
phase where N f ° is the smallest integer not less than N/Q. 
Similarly, for the AAF coefficients, 
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[0213] where % 1, 2, 3, . . . , Q denotes the selected 
phase and N g Q is the smallest integer not less than Ng/Q. 

[0214] The (N f Q +N g Q -l) coefficients of the combined 
filter response of the single phase IPF 324 and AAF 334 may 
be represented by b v For example, b, may be obtained by 
convolving f k Q and g, Q as follows: 



for 1=1, 2. ....*?+«?-!. 



zeros except one non-zero element (e.g., ±1). Without loss of 
generality, the largest magnitude element of h may be 
positive and the matrix-vector product may reduce to a 
delayed version of the input vector, e.g., 



X„ r A = 



x(n-d-l) 



[0215] From the combined filter coefficients b a , the ele- 
ments of h may be easily obtained. The effect of USB- P 322 
and DSB-P 336 may include the selection of a phase out of 
possible P phases. In other words, 

Ai-*pa-i>^ for M, 2, .... Z, 
[0216] where p x 1, 2, 3, . . . , P denotes the selected phase 
and L is the smallest integer not less than (N f Q +Ng Q -l)/P. 
Though the exact value of p, q ly and q 2 may not significantly 
impact the performance, the a priori knowledge of these 
integers may be used to calculate h. In the equations, the 
value in parenthesis generally refers to the time and vari- 
ables in bold generally refer to arrays (e.g., vec(n) refers to 
values of the array "vec" at time n). 

[0217] Once the expression for the error signal e(n) is 
obtained, the LMS update rule may be given as follows: 

[0218] where p is the LMS step size. Time subscript for w n 
may represent the LMS iteration. 

[0219] At a first glance, it may appear that the proposed 
LMS update requires (L+1)N multiplications per iteration 
where the factor L is due to the vector-matrix product X^h. 
However, due to the shift structure of e.g., X„ may be a 
Hankel (or other) matrix, per iteration, N multiplications 
may be used instead. Hence, the over all computational 
complexity of the LMS for RCDR-EC may be 2N multipli- 
cations per iteration. The multiplication required may be 
twice that of usual LMS which in comparison requires N 
multiplications. As a result, there is a savings of filter 
coefficients by, at least, a factor a. Thus, the computational 
complexity of the proposed LMS for RCDR-EC of the 
present invention is lower for a>2. 

[0220] According to another embodiment of the present 
invention, a simplified LMS may be implemented to reduce 
computational complexities by using an approximation on h. 
In one example, IPF 324 and/or AAF 334 may include low 
pass filters. Hence, the impulse responses may be similar to 
a sine function. In other words, the impulse response has a 
dominant peak and diminishes in magnitude away from the 
peak. Moreover, the convolution of two such filters may also 
exhibit a similar behavior. Variations may be made to h. For 
example, the largest element of h may be replaced with its 
sign. In another example, some or the rest of the elements 
may be set to zero (or other value). Replacing the largest 
magnitude element by its sign may result in a scaling of h 
where such scaling may be absorbed in ft. In addition, no 
approximation may be involved in this step. After such 
scaling, the remaining elements may have a magnitude less 
than one, which may then be approximated to zero. With this 
approximation, the coefficient vector h becomes a vector of 



[0221] where B is the approximate h and d is the index of 
the largest element of h. 

[0222] Using this approximation, the LMS update may 
simplify to, 

[0223] where x,, is the data vector defined before as 

x^x{n\ ^1), jr(n-2), . . . x{n-N + l)Y. 

[0224] Hence, the simplified LMS update for RCDR-EC 
may be similar to the normal LMS update except for the 
delay. In other words, the computational complexity of the 
simplified LMS for RCDR-EC is similar to that of usual 
LMS. 

[0225] Since both IPF and AAF are part of the system 
design, the approximation does not restrict the use of the 
simplified LMS. The IPF and AAF may be defined such that 
the resulting combined filter has a single dominant coeffi- 
cient. Factors that may degrade the convergence and per- 
formance of LMS may include multiple dominant peaks in 
the combined filter response and inexact setting of the delay 
d, for example. 

[0226] Since up sampling in the USB-P may be performed 
by zero filling, the sub blocks of RMB-TR 330 may be 
combined for efficient poly-phase implementation. The 
poly-phase implementation of RMB-TR may require 
RJMf/P (or equrvalently, R^^Q) multiplications per sec- 
ond. Efficient poly-phase structure may also be used for 
RMB-RT 330 and the number of multiplications that may be 
required per second is R^JNJQ. 

[0227] Various comparisons may be made between a con- 
ventional echo canceller and the RCDR-EC of the present 
invention for the case a>l, for example. 

[0228] As for structural differences, a conventional echo 
canceller is generally composed of an up sampling block 
followed by an adaptive FIR ECF that operates al the higher 
receive rate R^. In contrast, in the RCDR-EC approach of 
the present invention the adaptive FIR ECF may operate at 
the lower transmit rate R^. A rate matching block RMB-TR 
330 may be used to generate echo samples at the receive rate 
and another rate matching RMB-RT 320 may be used to 
calculated the error signal at the transmit rate. 

[0229] As for filter lengths, to capture the same (or simi- 
lar) fixed amount of time span (e.g., echo tail), the RCDR- 
EC of the present invention may use N filter taps as 
compared to the aN taps required by the conventional echo 
canceller approach. However, in addition to the ECF, 
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RCDR-EC of the present invention may implement addi- 
tional filters, which may include IPF and AAF. In general, 
the ECF is longer than the combined length of IPF and AAF. 
Hence, there is a significant reduction in the number of filter 
taps. For example, for a typical value of N=256 and a=8, the 
conventional filter requires approximately 2048 taps. 
According to an example of the present invention, even with 
two 64 taps IPF and AAF filters, the number of taps for 
RCDR-EC is approximately 384, significantly less than that 
required by conventional systems. 

[0230] As for LMS training computations, the computa- 
tional complexity may be approximately equal (or similar) 
for the conventional LMS and simplified LMS for RCDR- 
EC of the present invention. However, since the conven- 
tional ECF is a times longer than that of RCDR-EC of the 
present invention, the number of multiplications required for 
LMS update may be a times higher for conventional echo 
canceller. Due to the reduction in the filter taps, LMS for 
RCDR-EC of the present invention has lower complexity 
then the conventional approach for ct>2, according to one 
example of the present invention. 

[0231] As for steady state computational complexity, in 
steady state using efficient poly-phase implementation the 
total number of multiplications per second that may be 
required by RCDR-EC is (N+Nf/Q+N^Q)!^. In contrast, 
even with efficient poly-phase implementation, the number 
of multiplications per second required by the conventional 
echo canceller is at least aNR^. In a typical application, the 
total filter length of IPF and AAF may be considered small 
compared to the length of the ECF filter, e.g., Nf+N g <N. 
Hence, RCDR-EC of the present invention may reduce the 
steady state computation by at least a factor of a 2 . 

[0232] As for improved noise floor level and overall 
performance, in RCDR-EC the AAF filters out signals above 
a transmit band before generating the error signal e(n). Thus, 
the error signal has a reduced contribution from the wide 
band receive signal which acts as noise for echo cancellers. 
In a conventional echo canceller, the receive signal, even the 
part outside of the transmit band, contributes to the error 
signal. Hence, RCDR-EC of the present invention has a 
lower noise floor that results in reduced convergence time 
and improved overall performance of the LMS training 
algorithm. Other comparisons as well as advantages may be 
recognized by the present invention and variations thereof. 

[0233] FIG. 4 is a block diagram of a Reduced Complex- 
ity Dual Rate Echo Canceller (RCDR-EC), according to an 
embodiment of the present invention. The block diagram of 
FIG. 4 may support applications, such as ADSL CO, where 
the receive rate is tower than the transmit rate, e.g., a<l. 

[0234] Transmit Filter 410 may receive a transmit signal 
from a transmit path and generate a filtered transmit signal 
x(d). The filtered transmit signal x(n) may be an input to an 
Analog Front End or other device. 

[0235] Rate Matching Block (RMB) 420 may up-sample 
the input signal by a factor a (or other factor) with appro- 
priate filtering. The RMB 420 may include sub blocks, such 
as an Up Sampling Block with up sampling factor P (USB- 
P) 422, an Anti-Aliasing Filter (AAF) 424, and a Down 
Sampling Block with down sampling factor Q (DSB-Q) 426. 

[0236] The filtered transmit signal x(n) may be received 
by the US B P 422. The up sampling at the USB-P 422 may 



be achieved by a zero filling operation, e.g., the USB-P 422 
may insert P-l zeros between consecutive input signal 
samples. Other operations for achieving up sampling may be 
implemented. Thus, USB-P 422 generates samples P times 
faster than the input sample transmit rate R^. 

[0237] An output xl(n) from the USB-P 422 may serve as 
an input to the AAF 424. The AAF 424 may include a fixed 
low pass FIR filter that receives the up-sampled signal xl(n) 
and filters out most or all the high frequency signals above 
the receive band before feeding it to a down-sampling block. 
AAF 424 may operate at P times the transmit rate R^. 

[0238] An output x2(n) from the AAF 424 may serve as an 
input to the DSB-Q 426. The DSB-Q 426 may down sample 
the output of the AAF 424 by a factor Q by dropping Q-l 
samples from the input data stream and generate the output 
signal z(m) at the receive rate R^. 

[0239] Echo Cancellation Filter (ECF) 428 may include an 
adaptive FIR filter that receives the down sampled signal 
z(m) and generates a copy of the echo. Since both the input 
and output of the ECF 428 are sampled at the lower receive 
rate (R„), the ECF 428 operates at the receive rate R^. 
According to an embodiment of the present invention, 
RCDR-EC provides down sampling before the ECF, 
wherein ECF operates at the lower receive rate R^.. 

[0240] The transmit signal x(n) may be up-sampled and 
filtered by the FIR AAF 424 before being down sampled by 
the DSB-Q 426. The AAF 424 filters out most or all high 
frequency signals above the receive band. The down 
sampled signal at the output of the DSB-Q 426 may be 
denoted by z(m). The down sampled signal z(m) may then 
be filtered by the ECF 428 where the output of ECF 428 may 
be expressed as 



N 

zl(jn) = ^z(m-Jt + I)H* ~zLw y 



[0241] where is the data vector 

zjz(m), z(m-l), z(m-2X . . - z(m-N+l)f, 

[0242] w is the ECF coefficient vector, 

[0243] A receive signal from Analog Front End may be 
filtered by Receive Filter 430. The output zl(m) of ECF 428 
may then be subtracted from the incoming receive signal 
y(m) to generate error signal e(m), e.g., 

c<mH<m)-zl(/n). 

[0244] The samples of the error signal may be generated 
at the rate R^. This error signal is then used to adaptively 
train the coefficients of ECF 428 as described in the follow- 
ing section. 

[0245] According to an embodiment of the present inven- 
tion, an ECF input z(m) may not be the transmit signal but 
rather a down sampled version. Thus, to derive the LMS 
update rule for the RCDR-EC structure of the present 
invention, the error signal e(m) may be expressed in terms 
of the down sampled signal z(m), the receive signal y(m), 
and ECF coefficients as follows: 
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[0246] where y(m) is the received signal, z,, is the data 
vector, and w is the ECF coefficient vector as previously 
discussed. 

[0247] Once the error signal e(n) has been defined, the 
LMS update rule may be given as follows: 

[0248] where /i is the LMS step size. Time subscript for 
w^ may represent the LMS iteration. 

[0249] The above LMS update rule of the present inven- 
tion may be similar to a LMS update except for the fact that 
the data vector is the down sampled version of the transmit 
signal x(n), for example. Hence, the computational com- 
plexity of LMS update for RCDR-EC may be similar as that 
of the conventional LMS. 

[0250] According to another embodiment of the present 
invention, a dual rate echo canceller for simultaneous sup- 
port of multiple annexes of G.992.1 (or ITU G.dmt) is 
provided. Other annexes and variations thereof may also be 
supported. The present invention provides an efficient 
Reduced Complexity Dual Rate Echo Chancellor (RCDR- 
EC) architecture for ADSL CPE for simultaneous support of 
a plurality of annexes, such as Annex A and Annex B of 
G.992.1. In addition, low complexity LMS update rules are 
proposed for adaptive training RCDR-EC. 

[0251] The RCDR-EC architecture and LMS update rules 
of the present invention provide various features and advan- 
tages. The RCDR-EC structure of the present invention may 
be designed for digital echo cancellation for full rate ADSL 
at the CPE, for example. The proposed structure of the 
present invention may also support a plurality of annexes, 
such as at least both Annex A and Annex B of ITU Standard 
G.992.1, also known as G.dmt. Other standards, including 
wire-line technology, may be supported for added versatility 
and improved functionality. In addition, RCDR-EC of the 
present invention may operate at the transmit rate, the lower 
of the two rates, thereby requiring less computation per data 
sample. 

[0252] According to one example, the present invention 
provides an echo canceller structure for full rate ADSL CPE. 
Another aspect of the present invention is directed to design- 
ing the echo canceller flexible enough to simultaneously 
support, at least, Annex A and Annex B of G.992.1. Another 
aspect of the present invention is further directed to obtain- 
ing an efficient rate matching echo canceller implementation 
for full rate ADSL CPE as well as other applications. 

[0253] For example, the ADSL system, as specified in ITU 
standard G.992.1, may require full-duplex transmission over 
a single wire line thereby necessitating the separation of 
transmit and receive signals at the ADSL transceivers, such 
as the CPE. Typically, a passive analog circuit, e.g., a hybrid 
circuit, may be used for this purpose. In any realistic 
implementation, the transmit signal often leaks into the 
receive path whenever perfect balance is not achieved in the 
hybrid circuit The part of the transmit signal that leaks into 
the receive path is generally referred to as the echo signal. 
Even though a four-to-two wire conversion circuit has 
worked well for telephony application, the echo signal, if not 
adequately suppressed, may severely degrade the perfor- 
mance of full-duplex ADSL systems. Digital echo cancellers 
are commonly used at the ADSL transceiver to suppress the 
echo to an adequate level and to recover the performance 
loss due to the echo. 



[0254] ADSL systems support asymmetric transmit and 
receive data rates, where the echo canceller (EC) imple- 
mented in the system may be required to accommodate the 
rate difference. For ADSL systems, the upstream rate is 
generally higher than the downstream rate. In other words, 
for ADSL customer premises equipment the receive rate is 
generally higher than the transmit rate. In accordance with 
specifications recommended by the full rate ADSL ITU 
standard G.992.1, for example, the following may apply: 

[0255] where M is an integer larger than 1, R^ and R^ are 
the sample rates of receive and transmit signals, respec- 
tively. The transmit bandwidth requirements may be differ- 
ent for region specific Annexes, e.g., Annex A and Annex B 
of G.992.1, for example, and the value of M may depend on 
the particular Annex and/or other factors and conditions. 

[0256] The present invention provides a detailed imple- 
mentation of Reduced Complexity Dual Rate Echo Cancel- 
ler (RCDR-EC) structures for simultaneous support of, at 
least, Annex A and Annex B at the ADSL CPE, for example, 
as well as other applications. In addition, LMS update rules 
may be implemented for adaptive training of the RCDR-EC 
of the present invention. 

[0257] Annex selector 518 may include a number of 
switches (e.g., three switches) for selecting a variety of 
annexes (e.g., Annex A or B). As shown in FIG. 5, by way 
of example, the switch position A and B denotes the respec- 
tive Annexes. Other selections and/or options may be imple- 
mented. 

[0258] Echo Cancellation Filter (ECF) 520 may include an 
adaptive FIR filter that receives the up-sampled transmit 
signal x(n) and generates a copy of the echo, as shown by 
xl(n). As both the input x(n) and output xl(n) of the ECF 
520 may be sampled at the lower transmit rate (RJ, the ECF 
filter 520 may operate at the transmit rate R^. 

[0259] Output up sampling block (USB-O) 522 may 
receive the output xl(n) of the ECF 520 and up-sample xl(n) 
by a factor of M. For example, the value of M may be 8 for 
Annex A and 4 for Annex B, where the selection may be 
made via annex selector 524. Other values of M may be 
assigned for other options. Other variations may be imple- 
mented. The up sampling function may be achieved by zero 
filling or other operation. 

[0260] The Interpolation Filter (IF) 526 may be a fixed 
FIR filter that interpolates the up-sampled signal x2(m). 
Essentially, the IF 526 may be a low-pass filter that filters out 
most or all M-l replica of the transmit signal generated due 
to up sampling. The IF 526 may operate at the receive rate 
R„ and generate an output x3(m) at the receive rate R^. The 
output x3(m) of the IF 526 may be subtracted from an 
incoming receive signal y(m) to generate a residual echo 
signal e,(m). 

[0261] Anti-Aliasiog Filter (AAF) 532 may include a fixed 
low pass FIR filter that receives the residual echo signal 
e r (m) and filters out most or all the high frequency signals 
above the transmit band before feeding the residual echo 
signal e/m) to the down-sampling block (DSB) 534. AAF 
532 may operate at the receive rate R„. 

[0262] Down Sampling Block (DSB) 534 may down 
sample the output e^m) of the AAF 532 by a factor M and 
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generate an error signal e(n) at the transmit rate R^. For 
example, the value of M may be 8 for Annex A and 4 for 
Annex B, where the selection may be made via annex 
selector 536. Other values for M may be implemented for 
other annexes. 

[0263] For RCDR-EC of the present invention, the up 
sampling by a factor M (e.g., as performed by USB-O 522) 
may be performed after the ECF 520 as opposed to the 
conventional approach where up sampling is performed 
before the EFC 520. 

[0264] According to another embodiment of the present 
invention, the basic signal flow within RCDR-EC may be 
accomplished as follows: The transmit signal x(n) may be 
first delayed and then up-sampled. In the absence of the 
delay block 514, the ECF 520 may contain a leading 
sequence of zero coefficients, which may result in an inef- 
ficient use of the filter coefficients. The leading zero coef- 
ficients may attempt to model the delay of the echo signal 
with respect to the transmit signal caused by various ele- 
ments in the transmit path and the twisted pair line. The 
delay block 514 may be added to compensate for this delay 
of the echo signal and, hence, alleviate the burden for the 
ECF 520 to model the delay. According to an example of the 
present invention, the delay block 512 is not an absolute 
requirement but may be added for improved efficient use of 
the ECF coefficients. 

[0265] For example, the highest sub-carrier for Annex A 
and B may be approximately around 138 and 276 kHz, 
respectively. As a result, even though the width of the 
transmit bands may be identical (or similar), Annex B 
transmit signal may require approximately twice the sam- 
pling rate for a passband sampling structure. The input up 
sampling block (USB-I) 516 along with the annex selector 
518 may be used to accommodate various sampling require- 
ments. The output of the IFFT 510 may be sampled at a fixed 
rate, e.g., 276 ks/s (kilo samples/sec), for example. After the 
USB-I 516, the transmit sample rates (Rtx) may be approxi- 
mately equal to 552 ks/s and 1.104 Ms/s for Annex A and B, 
respectively. The up sampling function may be performed by 
inserting zeros (e.g., 1 for Annex A and 3 for Annex B) in 
between consecutive samples of the input signals. Other 
operations may be used to perform the up sampling function. 

[0266] The up-sampled transmit signal may then be fil- 
tered by a N-tap FIR ECF 520, where x(n) may represent the 
up-sampled transmit signal and {w l7 w^ w 3 , . . . w N }, may 
represent N adaptable ECF coefficients. The output of the 
ECF 520 may then be written as: 



[0267] where ^ is the data vector at time n represented by: 

xJx(nX x(n-l), x(n-2), . . . 
[0268] w is the coefficient vector represented by: 

[0269] and T denotes the transpose operation. 

[0270] The USB-O 522 may up-sample xl(n) by a factor 
M. In other words, for every input sample, the USB-O 522 



may generate M output samples. As mentioned before, the 
value of M may depend on (or be related to) the Annex type 
and/or other factors, e.g., 

(8, for Annex A 
4, for Annex B. 

[0271] The up sampling function may be achieved by a 
zero filling operation (or other operation) and the output of 
the USB-O 522 may be expressed as: 

(xl{n), if m-Mn 
{ 0 otherwise 

[0272] In other words, x2(m) may include samples of 
xl(n) with M-l zeros inserted in between. For example, 
samples of x2(m) may be expressed as follows: 



•» 0, xl(n - 1), 0, 0, — ,0 , xt(n\ 0, — 

M-l inuo 



[0273] Different time indices (e.g., n and m) may be used 
to represent samples with different rates. For Annex A, n 
may denote a sampling rate of approximately 5S2 ks/s and 
for Annex B approximately 1.104 Ms/s. For both Annexes, 
m may denote a sampling rate of approximately 4.416 Ms/s. 

[0274] The output of N f Up FIR IF 526 may be represented 
by: 



x3{m) = Y k x2{m-k + \)f k , 
k=i 



[0275] where 

{f b f»f,,...f M J 
[0276] are N f IF 526 coefficients. 

[0277] The output of IF 526 may be subtracted from the 
incoming receive signal y(m) to generate the residual echo 
signal, e.g., 

e,(/nyy{m')-x3(my 

[0278] The residual echo signal e r (m) may be filtered by 
the FIR AAF 532 to generate a filtered version e^ which 
may be expressed as: 



1=1 

[0279] where 

{gi> fora) 

[0280] are Ng AAF 532 coefficients. 
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[0281] The output of the AAF 532 may be down sampled 
using DSB 534 by a factor of M to generate the error signal 
e(n). In this case, the error signal e(n) may be expressed as: 

e(o)=c rt (mX where m=Mn. 

[0282] In other words, e(n) may include decimated 
samples of e^n) where the samples of e(n) may be written 
as 

.... ^(0-1)), c^Mn), erf (M(n+l)), . . - 

[0283] The value of M (e.g. , 8 or 4) may be selected by the 
Annex selector 536. 

[0284] According to another embodiment of the present 
invention, the error signal may be used to adaptively train 
the coefficients of ECF 520. 

[0285] To describe the LMS update rules for RCDR-EC 
structure of the present invention, the error signal e(n) may 
be expressed in terms of the transmit signal x(n), the receive 
signal y(m), and IF 526 and AAF 532 filter coefficients. The 
error signal e(n) may be written as: 

[0286] where y r (n) may be the received signal component 
after AAF 530 and DSB 532, w is the ECF 520 coefficient 
vector, 



jt(n) An -I) 
x{n - 1) x{n- 2) 



x(n-N + l) 



jctn-L+1) x{n~L) --■ x(n-L-N + 2) 



[0287] is the LxM input data matrix, and 

hlh^ h 3 , . . . hj 1 ", 

[0288] is the filter coefficients vector that represents a 
combined effect of IF 526 and AAF 532 along with effects 
of USB-0 522 and DSB 534. The (Nj+Ng-1) coefficients of 
the combined filter response of IF 526 and AAF 532 may be 
represented by b v For example, b x may be obtained by 
convolving f a and g 1 as follows: 



* = for 1 = 1.2,.... JV /+ Af f -l. 



[0289] From the combined filter coefficients b lf the ele- 
ments of h may be easily obtained. The effect of USB-O 522 
and DSB 534, in one respect, is to select a phase out of 
possible M phases. In other words 

Af-Won- for W, 2, .... 1, 

[0290] where p x{0, 1, 2, ... , M-l} denotes the selected 
phase and L may include the smallest integer not less than 
(Nf+Ng-iyM. Though the exact value of p may not signifi- 
cantly impact the performance, an a priori knowledge of p 
is useful in calculating h. 

[0291] The LMS update rule may be defined as follows: 

[0292] where fi is the LMS step size. The time subscript 
for w D may represent the LMS iteration. 



[0293] At a first glance, it may appear that the proposed 
LMS update requires (L+1)N multiplications per iteration. 
The factor L may be due to the vector-matrix product X^h. 
However, due to the shift structure of where X„ is a 
Hankel matrix, for example, per iteration, N multiplications 
may be required. Hence, the over all computational com- 
plexity of the LMS for RCDR-EC of the present invention 
may be 2N multiplications per iteration. The multiplication 
required may be approximately twice that of usual LMS 
which in comparison requires only N multiplications. Since, 
there is a saving of filter coefficients by a factor M, the 
computational complexity of the proposed LMS for RCDR- 
EC of the present invention is lower for at least both values 
of M-4 or 8. 

[0294] A simplified LMS may be used to reduce compu- 
tational complexities by using an approximation on h. 

[0295] As discussed above, the IF 526 and/or AAF 532 
may include low-pass filters. Generally speaking, the 
impulse responses may be similar to the sine function. In 
other words, the impulse response may exhibit a dominant 
peak and diminish in magnitude away from the peak. 
Moreover, the convolution of the two such filters may also 
exhibit a similar behavior. Other types of filters may be used 
for IF and/or AAF, in accordance with the present invention. 

[0296] For example, both IF and AAF may include low- 
pass filters for filtering out the high frequency signals above 
the transmit band. According to one example, the same 
(similar or related) coefficients may be used for both IF and 
AAF, e.g., f k =g k for all k. For an Annex B application, a 
40- tap low pass filter design may be used for at least one of 
IF and AAF, for example. The impulse response of the 
40-tap filter along with the time-domain convolution of the 
two filters are illustrated in FIGS. 6 and 7. FIGS. 6 and 7 
show that the resulting filter after convolution exhibit the 
properties discussed above. In particular, FIG. 6 illustrates 
an impulse response of IF and AAF. FIG. 7 illustrates a 
convolution of IF and AAF. 

[0297] According to an embodiment of the present inven- 
tion, the largest magnitude element of h may be replaced 
with its sign. Also, the rest of the elements may be set to 
zero. Replacing the largest element with its sign may involve 
the scaling of h where such scaling may be absorbed in fi. 
Accordingly, there may be little or no approximation 
involved. After such scaling, the remaining elements having 
a magnitude less than one may be approximated to zero. 
With this approximation the coefficient vector h becomes a 
vector of zeros except one non-zero element (±1). Without 
loss of generality, the largest magnitude element of b may be 
assumed to be positive. The matrix-vector product may be 
reduced to a delayed version of the input vector, e.g., 



XLh = 



Jc(n-J-1) 
An-d-N + l) 



[0298] where 6 is the approximate b and d is the index of 
the largest element of h. 
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[0299] Using this approximation, the LMS update may 
simplify to, 

[0300] where ^ is the data vector defined before as 

[0(301] Hence, the simplified LMS update for RCDR-EC is 
similar to the normal LMS update except for the delay. In 
other words, the computational complexity of the simplified 
LMS for RCDR-EC of the present invention is similar to that 
of usual LMS. 

[0(302] Since both IF and AAF are part of the system 
design, the approximation does not restrict the use of the 
simplified LMS. Thus, the IF and AAF may be designed 
such that the resulting combined filter has at least a single 
dominant coefficient. Factors that may severely degrade the 
convergence and performance of LMS may include multiple 
dominant peaks in the combined filter response and inexact 
setting of the delay & 

[0303] The USB-O 522 with the zero filling structure may 
be combined with the IF 526 and efficiently implemented 
using a poly-phase structure, for example. The same may 
apply for the DSB 534 and AAF 532. 

[0304] Differences between conventional echo canceller 
and RCDR-EC of the present invention may include various 
factors, such as structural differences, filter lengths, compu- 
tation for LMS training, stead-state computational com- 
plexities and noise floor levels, for example. 

[0305] As for structural differences, a conventional echo 
canceller may include an up sampling block followed by the 
adaptive FIR ECF that operates at the higher receive rate 
R^. In the RCDR-EC approach of the present invention, an 
adaptive FIR ECF may operate at the lower transmit rate R^. 
In addition, a USB and a IF may be used to generate the echo 
samples at the receive rate and a DSB and an AAF may be 
used to calculate the error signal at the transmit rate. 

[0306] As for filter lengths, to capture the same (or simi- 
lar) fixed amount of time span (e.g., echo tail), the RCDR- 
EC of the present invention may include N filter taps as 
compared to the MN taps required by the conventional echo 
canceller approach. However, in addition to the ECF, 
RCDR-EC may implement at least two more filters, e.g., IF 
and AAF, for example. In general, the ECF is longer than the 
combined length of IF and AAF and hence, there is a 
significant reduction in number of filter taps. For example, 
for a typical vafue of N-256 and M-8, the conventional filter 
may require approximately 2048 taps. On the other hand, 
even with two 64 taps IF and AAF filters, the number of taps 
for RCDR-EC of the present invention is approximately 
384, substantially less than that required in other systems. 

[0307] As for LMS training computation, the computa- 
tional complexity may be similar for the conventional LMS 
and simplified LMS for RCDR-EC of the present invention. 
For example, as the conventional ECF may be M times 
Longer than that of the RCDR-EC of the present invention, 
the number of multiplications required for LMS update may 
be M times higher for a conventional echo canceller. Due to 
the reduction in the filter taps, even the exact LMS for 
RCDR-EC may have M/2 times lower complexity then the 
conventional approach. 



[0308] As for steady-state computational complexity, in 
steady-state using efficient poly-phase implementation, for 
example, the total number of multiplications per second 
associated with RCDR-EC of the present invention may be 
represented by 

(N+Nf+NgJR^. 

[0309] In contrast, even with efficient poly-phase imple- 
mentation, the number of multiplications per second 
required by the conventional echo canceller is 

[0310] In a typical implementation, the total filter length 
of IF and AAF is less than the filter length of ECF, e.g., 
Nf+N g <N. As a result, RCDR-EC of the present invention 
may reduce the steady-state computation by at least a factor 
M. 

[0311] As for noise floor level and overall performance, in 
a RCDR-EC system, the AAF filters out signals above a 
transmit band before generating the error signal e(n). Thus, 
the error signal reduces the contribution from the wide band 
receive signal which acts as noise for echo cancellers. In a 
conventional echo canceller, most or all the out of band 
receive signal works to contribute to the error signal. 
According to the present invention, this reduced noise floor 
for RCDR-EC results in reduced convergence time and 
improved overall performance of LMS. Other comparisons 
as well as advantages may be recognized by the present 
invention and variations thereof. 

[0312] According to another embodiment of the present 
invention, a weighted error echo canceller for transceivers 
with unequal bandwidths is provided. The present invention 
enables an adaptive echo cancellation filter to perform at a 
lower sample rate. 

[0313] Echo cancellers (EC) may be commonly used in 
full duplex communication systems to suppress the echo of 
the transmit signal entering the receive path. When the 
transmit and receive path have different bandwidths (hence, 
different sample rates), the echo canceller may need to 
bridge the rate difference. With conventional designs, the 
echo canceller filter operates at the higher sample rate. 
According to the present invention, an efficient echo can- 
cellation scheme, referred to as Weighted Vector Error Echo 
Canceller (WEVE-EC) is proposed for various applications, 
such as applications where the receive path has a higher 
sampling rate. The WEVE-EC architecture may be imple- 
mented along with an adaptive algorithm, which may be 
based on Least Mean Square (LMS) update rules. Various 
other adaptive algorithms may be used to train the WEVE- 
EC of the present invention. 

[0314] According to an embodiment of the present inven- 
tion, an error vector, rather than a scalar error signal, may be 
implemented for updating the echo canceller In a conven- 
tional approach, essentially all the sampling phases of the 
error signal are treated as consecutive time samples of a 
scalar signal at the receive rate. In the proposed structure of 
the present invention, most or all the sampling phases of the 
error signal may be stacked in a vector and treated as a 
vector sampled at the transmit rate. 

[0315] An Error Weighting Multi-input-multi-output Fil- 
ter (EWMF) of the present invention provides a flexible 
weighting scheme on most or all the sampling phases of the 
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error signal. The Multi-Input Multi-Output (M1MO) filter 
structure is more general than the conventional Linear 
Tune-Invariant (LTI) Finite Impulse Response (FIR) filters. 
In particular, the LTI FIR structure may be considered a 
special case of the proposed EWMF. Hence, the proposed 
structure of the present invention provides greater variety in 
selecting a training method for echo cancellers. 

[0316] According to another embodiment of the present 
invention, the combination of an adaptive filter and an 
interpolation filter enables the echo canceller to operate at 
the lower transmit sample rate. This is in contrast to the 
conventional echo canceller approach where the echo can- 
celler operates at the higher receive sampling rate. In addi- 
tion, training and updating functionality of the echo cancel- 
ler may be conducted at the same (or similar) low sample 
rate. 

[0317] The architecture of the present invention reduces 
the number of filter coefficients without loss of performance. 
As a result, hardware requirement and computational com- 
plexities are significantly reduced and an increase in con- 
vergence speed of training algorithms may be achieved. 
Other advantages may be observed from the various 
embodiments of the present invention. 
[0318] The proposed architecture may be suitable for 
steady-state tracking of the echo-canceller. The proposed 
architecture may also be easily modified for other situations, 
such as the reverse situation where the receive bandwidth is 
smaller than the transmit bandwidth. 
[0319] The present invention provides an effective echo 
cancellation method that takes advantage of the situation 
where the transmit signal, and thus the echo signal generated 
from it, has a smaller bandwidth than that of the receive 
signal. The architecture of the present invention may include 
various components and function blocks, such as an Adap- 
tive Echo Cancellation Filter (AECF) 810 which may oper- 
ate at the lower transmit sample rate, an Interpolation Filter 
Bank (IFB) 820, and an Error Weighting M1MO Filter 
(EWMF) 830. Other functions may be implemented in the 
system of the present invention. 

[0320] A corresponding LMS algorithm of the present 
invention provides training, tracking and/or other purposes. 
A filter bank design provides an efficient implementation for 
enabling some or all functional blocks to operate at the lower 
sample rate. Compared with the conventional echo cancel- 
lation filter which operates at the higher sample rate, the 
structure of the present invention offers improved perfor- 
mance, while significantly reducing the required number of 
taps, the sample rate, as well as yielding improved training 
behavior and less power consumption. Moreover, the pro- 
posed algorithm of the present invention also yields better 
convergence and stability properties during training and 
other phases. 

[0321] FIG. 8 illustrates a basic signal flow and the 
functional blocks of the WEVE-EC, according to an 
embodiment of the present invention. The Weighted Vector 
Error Echo Canceller (WEVE-EC) of the present invention 
may include various functional blocks, such as an Adaptive 
Echo Canceller Filter (AECF) 810; an Interpolation Filter 
Block (IFB) 820; an Error Weighting Multi-input-muiti- 
output filter (EWMF) 830; a Reference Signal Interpolation 
Filter (RSIF) 840; and a \fectorization Unit (VU) 850. Other 
functional blocks may also be implemented, in accordance 
with the present invention. 



[0322] The AECF 810 may include a FIR filter with N w 
coefficients and an adaptive component of the WEVE-EC. 
The AECF 810 may operate at the lower transmit sample 
rate. The N w coefficients of the AECF may be stacked into 
a vector w, where 

wfw tt W„ . . . , Wj^jJ 1 ". 

[0323] The coefficient vector w may be adapted during a 
training period and may be updated during the steady state. 
The AECF 810 may receive the transmit signal x Q and 
generate a copy of the echo signal that is fed to the IFB 820. 

[0324] The IFB 820ft, . . . , f M } may include a fixed filter 
bank for generating M branches of signals from the output 
of the AECF 810 to match a receive signal vector. The 
branches of signals may be organized into the vector y Q a 
ly^f • ■ • » yn M J r at eacD ame instance or other time interval. 
For example, an integer M may be used where the archi- 
tecture of the present invention may be easily modified with 
a scheme, such as a mixed upsample-downsample scheme, 
to accommodate various rational values of M. The output of 
the IFB 820 may include a vector signal y i i which may be 
obtained by filtering the output of the AECF 810. This signal 
may be used to generate the residual echo r n . 

[0325] The weighted error vector represents the training 
error of the adaptive echo canceller filter. The EWMF 830H 
allows a flexible weighting on the error vectors and thus 
offers a vast set of options for advanced training of the 
WEVE-EC of the present invention. 

[0326] The RSIF 840 may include a fixed filter for gen- 
erating a reference signal for training and/or other purposes. 
The filtering operation may involve a combination of the 
IFB 820 and the EWMF 830, for example. 

[0327] The transmit signal ^ may occupy a different (e.g., 
smaller) bandwidth than that of the receive signal. The 
filtered transmit signal q L j may be obtained by filtering the 
transmit signal x^. The signal x Q may then be used to 
generate the reference signal Sj. 

[0328] The reference signal s*. may be obtained by filtering 
q j. The reference signal may be used for the LMS algorithm 
during the training of the AECF 810 with coefficient vector 
w. 

[0329] The receive signal vector may be represented as 

[0330] This receive signal vector may be obtained by 
stacking the M sampling phases of the receive signal into a 
M length vector. The receive path may provide a sample rate 
M times higher than the transmit path. Hence, the vectorized 
receive signal d„ has a sampling rate equal (or similar) to the 
transmit sampling rate. 

[0331] The VU 850 may include sub blocks, such as Delay 
Units z(k) and Down Sampling blocks. Other sub blocks 
may also be incorporated by VU 850. The Delay Units sub 
block as designated by z(k) may delay a signal by K 
samples. The Down Sampling blocks as designated by M 
and a down arrow may be used to drop M -1 intermediate 
samples from the input signal. In other words, Down Sam- 
pling block decimates the input samples by a factor M. 

[0332] The residual echo vector T n =d^-y Q may be obtained 
by subtracting the output of the IFB 820 from the receive 
signal During the training period, the residual echo vector 
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r n may be used to generate the weighted error vector e n for 
the LMS update or other purpose. The weighted error vector 
e n may include the filtered residual echo vectors and may be 
used for training purposes, e.g., to train the AECF 810. Once 
the WAVE-EC is properly trained, e n may contain a small 
portion of the echo signal. 

[0333] In the steady state, operational function blocks may 
be limited to the AECF 810, the 1KB 820, and the VU 850. 
The RS1F 840 and the EWMF 830 may be implemented for 
training/tracking purpose. The signals for steady state opera- 
tion may include the echo-canceller output y n k and the echo 
cancelled signal r n k . The signals Sfc and e k may be generated 
for the adaptive LMS. 

[0334] Various echo canceller parameters for the architec- 
ture of the present invention as well as the LMS algorithm 
may be implemented as described in detail below. 

[0335] Integer M may include the ratio of the receive 
sample rate to the transmit sample rate. M>1 implies that the 
transmit sample rate is lower than the receive sample rate. 
According to the present invention, the IFB 820 may be used 
on the transmit signal and a branching operation on the 
receive signal to bridge the transmit path and the receive 
path. 

[0336] Adaptive Echo Canceller Filter 810 length N w may 
represent the number of taps in filter 810. As the architecture 
of the present invention may operate at a lower sample rate, 
the length is substantially lower than that of an echo can- 
cellation filter operating at a higher sample rate. 

[0337] EWMF coefficients may be grouped into a matrix, 
such as {h^; m=l, . . . , M, n=l, . . . , N H }. The choice of 
the coefficients may depend on the nature of the transmit 
signal, receive signal, and/or the echo signal, as well as, the 
training algorithm used to train the echo canceller. Other 
factors may also be involved in the choice of coefficients. 

[0338] The dimension of the EWMF matrix may be rep- 
resented as MxN H . For LMS algorithms, N H may be a 
multiple of M, e.g., K=N H /M is an integer. K may represent 
the number of residual echo vectors that are used to generate 
the weighted error vector. A larger EWMF matrix permits a 
more flexible weighting scheme on the error signal at the 
cost of higher complexity. 

[0339] Interpolation Filter Bank (IFB) coefficients may be 
represented by {f&; k=l, . . . , M, n=0, . . . , tVl}: A 
dedicated filter of length N f may exist for each of the M 
signal branches. These coefficients may be chosen such that 
a desired interpolation filtering performance is achieved 
with a given filter length. 

[0340] Interpolation Filter Bank length may be repre- 
sented by N f . The length may be decided by a desired 
performance of the up sampling block, the computational 
cost and/or other factors. The performance as well as the 
complexity may increase with the length of the filter. 

[0341] LMS update step size sequence may be represented 
by /4 D . Generally, the larger the step size, the faster the 
training will converge. However, there may be a limit on the 
magnitude of the step size. For example, a large enough 
value may cause the training algorithm to diverge. On the 
other hand, the smaller the value of fi^ the smaller the error 
floor. For example, the step size sequence may be chosen as 
constant or as a decreasing set of numbers such that the 



algorithm quickly achieves convergence with large step 
sizes at the early stages of the training procedure and obtains 
low error floor with smaller step sizes later in the training 
procedure. This is commonly known as a gear shifting 
method. 

[0342] The training procedure of the present invention 
may involve adapting the coefficients of the echo canceller 
filter based on up-sampled reference sequences and/or 
weighted error signal. Other factors may be considered. The 
adaptive echo canceller may be matched to the echo path 
such that a significant component of the echo component in 
the receive signal may be cancelled. The effectiveness of the 
echo cancellation may be measured by the energy of the 
weighted error signal. The tracking procedure may use the 
same (similar or related) update mechanism as that of the 
training procedure or a variation thereof. 

[0343] The training method is not limited to the LMS 
algorithm as discussed above, but other algorithms may be 
used instead of LMS, in accordance with the present inven- 
tion. The scalar form of an LMS update rule may be 
described as follows: 

[0344] where w n is the adaptive entity for the LMS algo- 
rithm, e n is an error signal, n is the time index, and fi^ is the 
LMS step size. The update rule for the proposed LMS 
algorithm is shown below in step 918. Compared with the 
scalar form, the proposed update rule of the present inven- 
tion provides a more flexible weighting scheme for the error 
signal. 

[0345] In the proposed LMS algorithm, each update may 
involve the steps of FIG. 9. At step 910, a vector corre- 
sponding to a reference filter may be formed. At step 912, a 
matrix may be formed. At step 914, a weighted error vector 
may be formed. At step 916, an adaptive echo canceller filter 
may be updated. At step 918, the weighted error signal 
energy may be calculated. The steps of FIG. 9 will be 
described in detail below. 

[0346] At step 910, vectors s n \ . . . , s n M corresponding to 
reference filter may be formed where 



4 = t44-iR*"ii r 



[0347] At step 912, the matrix may be formed as follows: 



[0348] At step 914, weighted error vector may be formed 
as follows: 



*.=t*2Ki r 



[0349] At step 916, the adaptive echo canceller filter may 
be updated as follows: 
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[0350] At step 918, the weighted error signal energy e n T e n 
may be calculated. At step 920, it may be determined 
whether the energy is less than a predetermined value. If so, 
the process may be terminated at step 922. If the energy level 
is equal to or above a predetermined value, then n=n+l and 
a feedback loop to step 910 may be established. 

[0351] The present invention also provides advantages 
with respect to computational complexity. For example, it 
may take N w +MNf+MN H multiplications to generate the 
weighted error vector e n , MNf+MN H multiplications to form 
the matrix S n , and M 2 N W multiplications to execute the 
update equation for each update of the AECF. In total, 
(I+M^N^IM^^Nh) multiplications may be required for 
each update of the AECF. 

[0352] Additional details of the LMS algorithm of the 
present invention will now be described. The output of the 
echo canceller interpolation filter bank (e.g., IFB 820) 



[0353] may be written as 

[0354] where F represents the echo canceller Interpolation 
Filter Bank, e.g., 



vector may have a significant out-of-band noise. This out- 
of-band noise has detrimental effect on the adaptation pro- 
cess and should be removed. According to an embodiment 
of the present invention, the out-of-band noise may be 
removed or minimized by passing the residual echo vector 
through the EWMF, or other filter. As discussed above, the 
residual echo signal may be represented by 

[0360] Using the residual echo signal, the weighted error 
vector may be written as 











e n = 




= H 













[0361] In the formulation, K=N FI /M residual echo vectors 
may be involved in the filtering operation to generate a 
filtered error vector. By using the matrix 



H = 





A U - 










A*u 


A<U ■ 





/I" /?'* 



Al) - 
Jftf-l 

AD 
JNJ-X 

AH) 

Jf/f-l . 



[0362] to represent the weighting operation, the possibility 
of using different filters for each sampling phase is pre- 
served. For the special case where a single K tap FIR filter 
is used for all sampling phase, the H may become a Toeplitz 
matrix with the first row 



[0355] and 



[0356] is the transmit signal matrix. 

[0357] In one example, the filter bank F may implement a 
zero-insertion phis low-pass filtering operation. In such 
cases, F may be written as 



/w>i 

fzM-i 



f{Nf-l)M 



jftt , 0 (M 2 , 0,... 

[ M-l ana M-l » 



[0363] where the K non zeros elements {h ly . . . , hjj- may 
be defined as the FIR filter coefficients. A Toeplitz matrix 
generally refers to a matrix whose entries are constant along 
each diagonal. 

[0364] After defining the necessary signals, the corre- 
sponding Least Mean Square (LMS) algorithm for the 
coefficients of the echo cancellation filter may be derived in 
accordance with the present invention. For example/ the 
instant gradient for the LMS training algorithm may be 
obtained by taking the derivative of e n T e n with respect to w. 
The update rule may then be represented as: 



[0358] where [^ . . . , f^^F may be a NjM-length 

low-pass filter. _ 

[0359] As the receive signal may occupy a larger band- W ** 1 ~ 2 dw » 

width than that of the transmit signal, the residual echo 
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-continued 



[0365] The matrix multiplication in the update rule may 
represent the operations performed in the reference signal 
interpolation filter (e.g., 840) in FIG. 8. 



mately equal to a constant where the selected element may 
be any element of the TIR vector. The error may be 
computed by using the difference between the TEQ output 
and TIR filter output. TEQ filter coefficients may be adop- 
tively computed by minimizing the Mean Square Error 
(MSE) criterion. TEQ filter obtained through the algorithm 
of the present invention may provide minimal energy Inter 
Symbol Interference (ISI) and Inter Channel Interference 
(ICI). 

[0370] The computation requirement of the algorithm of 
the present invention may be linear in equalizer length, e.g., 



[XlF T \...\Xl K ^F T \.H T = 
9^1 «i-2 — 9^2 — 



Sit 



sent back to annotation 



[0366] where the filtered output Sj l may be arranged into a 
matrix. The final form of the update rule may be represented 
as 

[0367] According to another embodiment of the present 
invention, a minimum mean square error linearly coo- 
strained fast algorithm for adaptive training of a Time 
Domain Equalizer (MLC-TEQ) is provided A fast adaptive 
algorithm of the present invention may be used to obtain 
Finite Impulse Response (FIR) filter coefficients for Time 
domain Equalizer (TEQ) used in Discrete Multitone (DMT) 
based applications, such as ADSL, for example. The TEQ 
coefficients obtained by the algorithm of the present inven- 
tion shortens the overall effective discrete time channel 
impulse response length within a given target length (e.g., 
symbol prefix length for DMT application). Advantages of 
the proposed data aided adaptive algorithm may include 
providing the TEQ filter coefficients with near-optimal per- 
formance; having low computational requirements, having 
fast convergence, and exhibiting attractive stability proper- 
ties. Other advantages may also be realized by the present 
invention and variations thereof. 

[0368] Other features of the algorithm of the present 
invention may include implementing a Target Impulse 
Response (TIR) vector, e.g., the combined response of the 
transmission channel and TEQ filter, as a search vector, e.g., 
the vector that needs to be calculated 

[0369] The algorithm of the present invention may con- 
strain a selected element of the TER vector to be approxi- 



the number of filter taps of the equalizer. As a result, the 
algorithm is suitable for practical implementation in various 
applications. The algorithm of the present invention may 
adaptively calculate an equalization delay using the training 
and received sequences thereby providing efficient use of 
FIR TEQ filter coefficients. The algorithm of the present 
invention may incorporate input data conditioning through 
use of dummy signals thereby providing enhanced stability 
and improved convergence. 

[0371] The present invention provides a fast, stable and 
high performance adaptive algorithm for computation of 
near-optimal FIR TEQ coefficients. The resulting TEQ filter 
shortens the overall channel response such that residual 
energy of the overall channel outside the target length is 
minimized. In applications, such as DMT, TEQ filters 
obtained through the algorithm of the present invention 
provides minimum energy Inter Symbol Interference (ISI) 
and Inter Channel (or Carrier) Interference (ICI) and as a 
result yields a higher bit rate. Other advantages may also be 
recognized. 

[0372] In Discrete Multitone (DM1) systems, it may be 
desirable for the discrete time channel spread to be less than 
a prefix length to avoid ISI and ICI. In some applications, the 
spread of the channel may be greater than the prefix length. 
Hence, preprocessing of data may be implemented to 
shorten the effective channel seen by the DMT receiver. An 
approach, e.g., Time domain Equalization (TEQ), may be 
implemented to apply (sample or fractionally spaced) linear 
filtering to the sampled receive data to shorten the effective 
channel spread The present invention provides an approach 
to designing sample spaced and fractionally spaced (e.g., 
with factor 2 or other factor) Finite Impulse Response (FIR) 
TEQ filters. 
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[0373] FIG. 10a outlines a basic setup for a TEQ system, 
according to an embodiment of the present invention. Here, 
x k represents the k-tb sample of the transmitted signal with 
variance ®P X . Block 1010 with up-arrow and M corre- 
sponds to an up-sampling function with factor M, which 
may involve inserting M-l zeros between the samples of x k . 
For example, for M=2 case, i k sequence may be given by 

0, Xj, 0, x_ x , 0, 0, Xj, 0, 0, X3, 0, *- 

[0374] For example, M=l may refer to a sample spaced 
TEQ, as represented in FIG. 10b 9 and M=2 may refer to a 
fractionally spaced TEQ. 

[0375] The combined effects of the transmit filter shaping, 
the receiver filter, and the distortion caused by the transmis- 
sion channel may be modeled by a Linear Time Invariant 
(LT1) system, as illustrated by 1012, with impulse response 
denoted by h±. The combined effects of the receiver noise 
and the interferences may be modeled by the additive 
disturbance v k . 

[0376] In FIG. Ufa, block 1014 represents a TEQ filter 
having N^q filter coefficients. N j^q TEQ coefficients may 
be denoted by {w 1? w 2 , and TEQ coefficient 

vector may be denoted by w, e.g., 

Wt= ( w l w NTEoI r 

[0377] An output of the TEQ filter 1014 may be down- 
sampled where the down sampling process is represented by 
a block 1016 with down-arrow and M. The down-sampling 
operation may involve selecting 1 out of M samples of o^. 
For example, for M=2 case, z*. sequence may include 

"~0-2> °f» °2> °4» *~ 

[0378] The time domain equalization may convert the 
channel impulse response h k into an impulse response with 
N T nonzero consecutive entries. Therefore, if the nonzero 
entries of the resulting channel are modeled with the 
sequence {b^ k xjO^-N^l}}, the target channel may be 
modeled as 




b^t d&nzd + N T -\ 
0 otherwise; 



[0379] where d is the effective delay corresponding to the 
starting location of the non-zero segment of the impulse 
response. 

[0380] In DMT based DSL systems, such as G.dmt and 
G.lite, standard generally dedicates certain training 
sequences for the computation of TEQ coefficients. These 
training sequences may be periodic DMT symbols without 
prefix where the period is one symbol long, for example. 
According to one example, {p n , n x{°» N R -1}} mav 
represent the DMT symbol of length N R , where the trans- 
mitted training sequence r n represents the repeated version 
of Pn> e.g., 

r a f> m and m-mod(n, NjJ 

[0381] where mod(n, Nr) is the remainder of n/N R and 
hence, m is an integer between 0 and N R -1. 

[0382] Therefore, r n may represent a periodic extension of 
the sequence p Q , e.g., r n may be obtained through periodic 



repetition of p n where the period is N R . The periodic exten- 
sion may be expressed as 

*~> Pnh-2> Pnr-x> Po» Pi> Pz> *-» Pnr-i> Po» Pn *~~ 

[0383] An adaptive TEQ problem may involve developing 
an algorithm that processes training signal r Q and corre- 
sponding received signal y n to obtain TEQ filter coefficients 
without any a priori knowledge of channel and noise statis- 
tics. 

[0384] For DMT applications, an optimal approach may 
involve choosing {w k } such that the number of bits loaded 
per symbol may be maximized. However, this approach may 
be difficult to implement and impractical for adaptive imple- 
mentations. Other alternative schemes either have high 
computational complexity and are therefore not suitable for 
adaptive implementations or have lower complexity but low 
performance and instability in the convergence. The present 
invention provides an adaptive approach that yields 
enhanced (or near-optimal) performance. 

[0385] MLC-FAST-TEQ algorithm of the present inven- 
tion uses training data sequence {r D }, and the corresponding 
received data sequence {y n }. Exemplary special cases for 
the algorithm may include a Sample Spaced Case where 
M-l and a Fractionally Spaced Case with factor 2 where 
M«2. In the Fractionally Spaced Case, {y en } and {y on } may 
represent even and odd phases of the received sequence 
respectively, e.g., 

y«-y2« 

[0386] where y n is the over-sampled received sequence. 

[0387] In this section, various algorithm parameters, 
which may include predetermined constants independent of 
the data, may be used in the algorithm of the present 
invention. Their values may be adjusted to achieve different 
levels of performance and stability as well as other results. 

[0388] The algorithm may assume certain conditions 
about the amount of training sequence data required, e.g., 
the length of the training sequence N^. For example, the 
training sequence may involve consecutive samples of the 
received signal {y n ; n x{0, - - - , N^-l}} for the sample 
spaced case. In another example, the training sequence may 
involve consecutive samples of the sequences {y cn ; n x{0, . 
. . , N te -1}} and {y on ; n X {0, . . . , N tt -1}} for the fractionally 
spaced case. 

[0389] Parameters used in the algorithm of the present 
invention include: &r which represents target channel 
length; N^q which represents TEQ filter length; N R which 
represents symbol length; L which represents fixed tap 
location where L%{0, . . . , N^-l}; N s which represents TEQ 
algorithm search vector size Ns-Nt+Nteq; ® which rep- 
resents least square cost function forgetting factor; N D which 
represents maximum tone number for the dummy signal and 
N*x which represents number of update variables where 

(3 Sampled SpacdeCase 

4 FracnonaUySpccedCaseWahFoczor2 

[0390] Other parameters may include {C,^ k x{0, 
N D -1}} which represents frequency domain dummy signal 



US 2003/0112887 Al 



20 



Jun. 19, 2003 



values. There are N D such complex values, e.g., may be 
complex numbers. Other parameters may include {•□} 
which represents time domain dummy signal, which may be 
determined by the frequency domain dummy signal values. 
The following equation shows the relation: 



N D -l 



[0991] where the term Real part of stands for the real part 
of a complex number. The time domain dummy signal may 
be defined as the repeated version of e.g., {•^• m }, 
where m=mod(n, Nr). In other words, may represent the 
periodic extension of the signal # n . 

[0992] Other parameters may include S which represents 
the scaling of the received dummy signal; Off which repre- 
sents Internal dynamic range arrangement parameter; and 
C&which represents a priori variance constant used for the 
initialization of the matrix F. 

[0393] The MLC-FAST-TEQ algorithm of the present 
invention includes at least two steps, which may include the 
adaptive computation of the equalization delay parameter d 
and the adaptive computation of the TEQ filter coefficients 
based on the delay d calculated in the first step. 

[0994] An estimate of cross-correlation function {q(k); k 
x{0, MxN R -l}} may be computed as 



1-0 n-0 J 



[0409] ?: Column Vector (Size r^xl.); 

[0404] (8> Column Vector (Size N^l.); 
[0405] ^: Column Vector (Size N x xl.); 

[0406] A: Matrix (Size r^xNg), initialized to all 
zeros; 

[0407] D: Matrix (Size N x xN s ), initialized to all 
zeros; 

[0408] X: Column Vector (Size N s xl), initialized to 
all zeros; 

[0409] k: Column Vector (Size N s xl), initialized to 
all zeros; 

[0410] F: Matrix (Size N^N^, initialized to Cftl^, 
where I Nx stands for identity matrix of size r^xN^ 

[0411] t: Column Vector (Size N x xl), initialized to all 
zeros; 

[0412] b: Column \fector (Size N s xl), initialized to 
all zeros; 

[0413] m: Column Vector (Size N s xl); 

[0414] (5> Column Vector (Size N x xl); 

[0415] c: Column Vector (Size N s xl), initialized to 
all zeros; 

[0416] x: Scalar, 

[0417] e: Scalar; and 

[0418] n: Scalar. 

[0419] TEQ coefficient algorithm steps include the fol- 
lowing: 



[0395] where N av MN R <N te . 

[0996] Then Q) may be defined as the argument maxi- 
mizing the absolute value of the cross-correlation function: 

[0997] Therefore, (T) represents the peak point location 
for the absolute cross-correlation function. 

[0998] An equalization delay d may be selected as a 
function of (l). An appropriate choice of d may be given by: 

[0999] In other words, 



d = 



Nteq 
2 4 



Sampled SpaeedCase 
FractionaUySpocedCase 



[0400] Adaptive computation of TEQ coefficients for a 
sample spaced case includes algorithm variables such as the 
following: 

[0401] Column Vector (Size N x xl.); 
[0402] (5> Column Vector (Size N^L); 



step 1. n = 0 
step 2. @ = 



step 3. ® = 



Xff. 



'TZQ 



5tCp 4. =© 4- AX, 
step 5. A =A-kfr* T , 
step 6. f =ft*(l -k T X\, 

step 7. F = ® F, 

r 

step 8. i = F - 



step 9. F = F-i{ffr> T F\ 
step 10. t ~ C8 Ffr s 
step 11. b = k + At t 
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step 12. m = 



-continued 

b H»7EQ~*) 

tz 



Step 13. @ T = [bffTEQ bffTEQ+Lrl J, 



6 



step 14. X = 



step 15. -<V = ® + Z>X, 

step 17. D = D-k«& T , 
step 18. > = c T X % 
step 19. e - + ©^.^ 

step 20. c 
step 21. n=n + l, 
step 22. if n < then gp to step 2, else stop. 



[0420] TEQ coefficients may be given by the first 
taps of the vector c, e.g., 

[0421] Adaptive computation of TEQ coefficients for a 
fractionally spaced case with Factor 2 (e.g., M=2) may 
include variables that are identical (or similar) to that for the 
sampled spaced case, discussed above. The steps for the 
algorithm may include the following: 



step 1. n = 0, 



step 2. @ • 



step 3. <S> 



X »TZQ 



step 4. ? =<S> 
step 5. vt=*-Ar> 7 , 
step 6. T = ftr{l -k T X\ 



-continued 

step 7. F = Q> F, 



stepS. t = F 



step 9. F = F - t{rtr T F), 
step 10. f = CS/ r T 
step II. 6 = k + At, 



step 12. m = 



Step 13. ® 7 = [ t>NTEQ btfTEQ+L-X ], 
ft 

step 14. X= 6 

ft 

step 15. = ® 

step 17. D = D-t«^ r , 
step 18. x = c r X, 
step 19. * - r^-i^y + © Jt _ <f _ £ ^ | -x, 
step 20. c = c + kr, 
step 21. n = n+ 1, 
step 22. if n < then fp to step 2, eke stop. 



[0422] TEQ coefficients are given by the first Ntcq taps of 
the vector c, e.g. 

W * C 1:NTEQ 

[0423] Further to the adaptive manner of obtaining FIR 
coefficients in a TEQ filter described above, we now turn to 
a method and system for shortening the channel impulse 
response in DMT systems wherein a model is employed to 
effectively represent a hypothetical delay transmit signal in 
conjunction with a desired shortened channel impulse 
response filter to arrive at near optimal TEQ coefficients. In 
contrast to the adaptive process described above, where the 
TEQ setup involved a fractionally spaced TEQ scenario in 
which up-sampling/down-sampling factor M=2, as shown in 
FIG. 10c, the TEQ filter solution described in this section is 
based on a sample spaced TEQ setup in which M=l, as 
shown in FIG. 106. 
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[0424] In DMT systems, each received data symbol or 
packet is commonly prepended with a guard sequence called 
the cyclic prefix that is used to space the symbols so that the 
receiving system recognizes each individual packet of data. 
Each DMT symbol typically consists of N samples (where 
N is an arbitrary integer), while the cyclic prefix is a copy 
of the last v (where v is an integer less than N) samples of 
the DMT symbol. In order to prevent ISI, the value of v must 
be greater than or equal to the length of the channel impulse 
response, i.e, the channel length, commonly denoted by the 
variable h. 

[0425] Upon receipt of the symbol by the receiving 
modem, the cyclic prefix is discarded, since it includes no 
information not otherwise included within the symbol. The 
symbol is further processed, and conventional methods of 
shortening the channel impulse response are applied. 
Because the cyclic prefix does not convey any new infor- 
mation about the transmitted signal, efficiency of the system 
decreases in proportion to N/(N+v). Therefore, in order to 
maximize system efficiency, either N should be large, or v 
should be small. However, as N increases, memory require- 
ments in the communications system are also increased. 

[0426] In DMT systems, it is further desirable that the 
discrete time channel spread is less than the prefix length so 
as to avoid ISI as well as ICI, which is the convolution of 
signals on overlapping carrier channels. In applied systems, 
the spread of the channel is typically significantly greater 
than the prefix length. Therefore, preprocessing of data is 
required to shorten the effective channel seen by the DMT 
receiver, thereby reducing the occurrence of ISI and Id. The 
most common approach to this preprocessing function is to 
apply Time Domain Equalization (TEQ) to the sampled 
receive data to shorten the effective channel spread. TEQ 
refers to a process for filtering the incoming signal in such 
a way as to combine the data signals from various channels 
and reduce the cyclic prefix of the resulting signal to a 
uniform size that is as small as possible. 

[0427] FIG. 106 illustrates the basic scenario involving 
the above described TEQ problem. In particular, x k (where 
k is an integer) 1000 denotes the transmitted signal that is 
propagated along a channel and ultimately received by a 
modem. The transmitted signal includes, for instance, v+1 
input samples. The effects resulting from the transmission 
channel are modeled and represented by the h channel block 
1012, where the resultant, b^ denotes the original "long" 
channel impulse response, k is a whole number from 1 to 
N 0 where N c is the channel impulse response length of the 
channel h. The resultant signal h k is further distorted by 
additive disturbance v k (where k is a real number), which 
models the receiver noise and other signal interferences. 
This effect is represented, for purposes of illustration, by the 
combination of \ and v k at hypothetical adder 1002. The 
resultant is the combined, un filtered signal y k that represents 
the signal as input at the receiver device, for instance a 
modem. The signal y k is received at the TEQ filter 1014 for 
preprocessing as described further below. The signal y k is 
the resultant vector of the current and previous received 
samples, N w , and is passed through TEQ filter 1014, which 
operates to shorten the channel impulse response of y k and 
results in output signal vector z± 1004. The TEQ vector w T 
to be applied to the signal y k may be calculated using TEQ 
filter coefficients {wj, and vice-versa, and is combined with 



the signal y k to produce the filtered output signal 1004 
having a shortened channel impulse response. 

[0428] Initially, the transmitted signal, x k 1000, is assumed 
to be an uncorrected and identically distributed wide sense 
stationary sequence having a zero mean and a squared 
variance of x k , o x 2 . It should be understood that this assump- 
tion is not entirely accurate in that the existence of the cyclic 
prefix, which is the simple repetition of a portion of the 
transmitted samples, causes some degree of correlation 
among the samples. However, generally, the separation 
between the repeated samples is large enough to neglect the 
correlation due to the prefix. 

[0429] Accordingly, the combined effects of the transmis- 
sion filter, the receiver filter, and the distortion caused by the 
transmission channel are modeled by a linear time invariant 
system with the impulse response b k . As briefly described 
above, the receiver noise and other interferences are mod- 
eled by additive disturbance v k , which is assumed to be a 
wide sense stationary sequence having a zero mean and 
correlation coefficients represented by: 

[0430] where m is a real number and where r v is the noise 
autocorrelation sequence. 

[0431] In one manner, the TEQ filter vector w T is calcu- 
lated using a number (designated as N-j^q) of Finite Impulse 
Response (FIR) TEQ filter coefficients {wj, where the 
coefficients are defined as: 



to;* e|l ... Njzq}) 

and 



[0432] A goal of TEQ is to convert the channel impulse 
response b^ into a target impulse response with effectively 
only a certain number of nonzero entries, designated as 
Therefore, if the nonzero entries of the resulting channel are 
modeled with the sequence b^, where k is a whole number 
from 0 to Np-1, the target channel, t^ can be formulated as: 

0 otherwise, / 

[0433] where d is the effective delay corresponding to the 
starting location of the non-zero segment of the impulse 
response. 

[0434] For TEQ filtering in a DMT system, one optimal 
approach involves selecting {w k } such that the number of 
bits loaded per symbol, B, is maximized. This approach is 
equivalent to maximizing B as: 
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[0435] where T is the set of transmit tones, S t -(w) is the 
desired signal energy at tone i, I t (w) is the combined energy 
of ISI and I CI at tone i caused by the components of 
outside d^n^d+N^l, Nj(w) is the noise energy at tone I, 
and r is the effective gap which is a function of the 
constellation, coding gain, and the margin requirement. 

[0436] Unfortunately, the maximization of B with respect 
to w is a difficult, processor intensive computation, and is 
impractical for most real world, adaptive implementations. 
Therefore, some alternative near-optimal schemes have been 
developed. One such scheme includes maximizing the geo- 
metric Signal to Noise Ratio (SNRq) as: 



wi? 4 n Si{w) 



[0437] SNRq can be defined as the product of SNRs at 
each transmitted tone within the symbol. However, this 
process is also computationally complex and, consequently, 
is also unsuitable for practical applications. Alternative, 
comparatively lower complexity approaches such as that 
described in U.S. Pat. No. 5,285,474, to Chow et al. have 
been developed. However, the reduction in computational 
complexity is achieved at an unacceptable cost of significant 
performance degradation or convergence instability in most 
real world applications. 

[0438] An alternative solution to the TEQ problem 
includes minimizing wall energy with a constant constraint 
on the power of the target impulse response which leads to 
a computationally demanding Singular Value Decomposi- 
tion (SVD) based algorithm. Wall energy is defined as the 
residual impulse response energy outside of the region 
corresponding to the target impulse response. One manner 
explored for enabling such wall energy minimization 
includes utilizing a least square fit based approximation. 
Unfortunately, this solution has proven to result in unac- 
ceptable levels of performance degradation. 

[0439] An alternative solution to the above problem 
includes a conventional Minimum Mean Square Error 
(MMSE) approach, where the target impulse response (TIR) 
coefficients are assumed to be part of the desired final, 
filtered signal vector (the search vector). However, the 
squared norm constraint assumed in this approach to avoid 
an all-zeros solution leads to numerically difficult compu- 
tations, such as an eigenvector computation to obtain TEQ 
coefficients. It is especially hard to implement direct adap- 
tive training of the equalizer coefficients under this con- 
straint. 

[0440] While theoretically it may be possible to achieve 
improved performance using a significantly longer TEQ 
filter, the computation needs, power and silicon area 
required to implement such a long TEQ filter arrangement is 
not likely cost effective and largely impractical. Further, in 
practical applications, adaptive computation of TEQ filter 
coefficients is a necessity. Still further, most digital signal 
processors (DSPs) in practical applications support only 
fixed point arithmetic. As a result, finding stable adaptive 
algorithms with fixed point arithmetic to compute long TEQ 
filters may not be practical. Therefore, there is a need in the 
art of digital communication systems for a practical method 



and system for effectively and efficiently shortening the 
channel impulse response b k to a filtered channel impulse 
response z t . 

[0441] The present invention overcomes the problems 
noted above, and provides additional advantages, by pro- 
viding for a method of applying linear filtering to the 
received data set The filtering is achieved by factoring the 
transmit data (xj along a channel (hj combined with 
additive disturbance/noise consideration (vj to arrive at a 
combined unfiltered signal (v*), and in consideration of a 
hypothetical sampling of the transmit data along a delay 
channel (d) to arrive at a target impulse response. From this, 
the present invention derives a Minimum Mean Square 
Linearly Constrained Time Delay Equalization (MLC-TEQ) 
mathematical model and applies the target impulse response 
to achieve a shortened channel impulse response (z^). The 
shortened channel impulse response defined by processing 
the unfiltered received data (yj through the TEQ coeffi- 
cients and TEQ vector w T is effectively compared with the 
hypothetical target channel impulse response (tj, which is 
derived from the delay channel as applied to (b^), to deter- 
mine an error e k which is to be minimized by design. In this 
manner, the received data signal (y^) is converted into a TEQ 
filtered signal (zj designed to approximate a desired target 
channel (tj. The invention shortens the overall channel 
response in a communication system such that the residual 
energy of the overall channel outside the target length is 
minimized. In addition, Inter Symbol Interference is mini- 
mized by the present invention. 

[0442] The block diagram of FIG. 10c illustrates the 
inventive MLC-TEQ filter approach that overcomes the 
shortcomings of earlier approaches and yields the aforemen- 
tioned advantages. In particular, a transmit data signal, x k 
1000 is propagated along a channel 1012 having distortive 
effects on the signal, as represented by the h channel block 
1012, and has an effective channel impulse response repre- 
sented as h^ at 1020. To more fully represent the effects of 
various processes on the transmit signal, a disturbance 
vector v k , winch models the various receiver noises and 
interferences, at 1022 is applied to the channel impulse 
response to yield an unfiltered received data signal y k at 
1024 that is received at an input of the TEQ filter 1014. By 
way of example, the received data signal y k could represent 
the input of a modem commonly used in communication 
systems. Unfiltered signal y k is the resultant vector of the 
current and previous N w received samples, where N w is 
defined as an integer representing the number of replicate 
received samples of transmitted data and represents TEQ 
length. 

[0443] The unfiltered y k signal is passed through Time 
Domain Equalization (TEQ) filter 1014, which shortens the 
channel impulse response and generates a shortened channel 
impulse response signal 7* at 1026. As shown by the hatched 
lines, a hypothetical parallel branch of the transmitted data 
signal is shown as a delay function (d) 1032 applied to a 
determined target vector \ to yield a target channel ^ for 
modeling the shortened channel impulse response to arrive 
at the TEQ filter coefficients. In effect, the TEQ filter 1014 
shortens the channel impulse response to rninimize the mean 
square error e^ E(e k 2 ), between the received signal and the 
hypothetical delay channel, thereby approximating the 
desired target channel. 
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[0444] As showo in FIG. 10c, hypothetically, the delay 
channel 1032 generates a signal at 1034 having a Target 
Impulse Response b 1036. The delay channel 1032 output is 
acted on by TIR b vector 1036 to produce the target impulse 
response channel t n is, in a sense, compared with the TEQ 
filter output vector z± 1026 resulting in a data error signal e^ 
at 1004, which represents the kth error signal (that is, the 
error signal that corresponds to the resultant data signal). In 
determining the desired TEQ filter coefficients, e^ is mini- 
mized. 

[0445] In order to minimize the minimum mean least 
square error between the parallel communication channels 
(i.e., the impulse channel and the delay channel), it is 
necessary to calculate the TEQ filter coefficients used in 
generating the Equalizer Coefficient Vector w T such that the 
resultant vector has minimum energy Inter Symbol Interfer- 
ence (ISO and Inler Channel Interference (Id). 

[0446] The MLC-TEQ approach of the present invention 
provides a family of solutions that are parameterized by 
variables d e{0, . . . , N^N^o-Nt^} and L e{0, . . . , 
It is further assumed that the channel impulse 
response {h^ i E{0, . . . , N c -1}} and the noise autocorre- 
lation sequence {r v (i), i G Z} are known or estimated. Each 
solution has a closed form expression in terms of d, L, {h^ 
i G{0, . . . , N c -1}}, {r v (i), i <E Z}, TEQ length N TCQ and 
the target impulse response length rip. For fixed values of d 
and L, the MLC-TEQ filter determination steps are dis- 
cussed in more detail below. 

[0447] The approach embraced by the TEQ filter determi- 
nation process of the present invention poses the TEQ 
problem as one of minimizing the error differentiating the 
parallel branches of the h channel 1012 and hypothetical d 
channel 1032 of FIG. 10c to determine the desired TEQ 
coefficients {w k }, represented as: 

{w k ;ke{l...ti niQ }} 

[0448] and Target Impulse Response coefficients {bj as: 

fok £{!,...,*,.}}. 

[0449] An additional constraint on w or b is used to avoid 
the trivial all zeros solution. In the present invention, that 
constraint is preferably on b, where one of the taps of b is 
constrained to be equal to a non-zero constant as b L =c, 
where L E{1 . . . Hp}, c E9t, and c*0. C may be chosen as 
c=l without a loss of generality. This constraint allows the 
use of linear mean square minimization techniques, and the 
resulting algorithm is therefore more suitable for fast and 
efficient direct adaptive implementations. 

[0450] Due to the disturbance v k and the finite length 
constraint on b and w, the error sequence {ej has been 
found to generally not be equal to zero. Therefore, the error 
may be minimized in order to create effective filtering. In 
one manner, the mathematical method of the present inven- 
tion, targets the minimization of the mean square of error e k . 
In order to do so, the TEQ problem is preferably posed as to 
minimize E^ 2 ) for w and b such that b^-l, where: 

b = [bo ... V-if; 



-continued 

and 



[0451] is the expected value (mean) of the square error, 
which is the minimization of mean square error by choosing 
vector variables w and b under the constraint that the Lth tap 
of vector b is constrained to be equal to 1. Minimization of 
E over w and the unconstrained components of b is a linear 
mean square minimization problem and the result for TEQ 
coefficient vector w is given as: 

[0452] Referring now to the flow chart of FIG. 11, one 
embodiment of a method for determining a set of TEQ 
coefficients to be used to generate a signal vector in 
accordance with the present invention. Initially, in step 1100, 
the combination of the output of the h channel and the 
disturbance vector, y k , is sampled. In step 1102, the hypo- 
thetical delay channel d is set. The impulse response length 
of the target channel (tj is determined in step 1104. Optimal 
TEQ filter coefficients {wj are determined in step 1106, 
such that the number of bits of data loaded per symbol (B) 
is maximized while the channel impulse response is mini- 
mized to provide an output with minimal noise and height- 
ened data density. In step 1108, an equalizer coefficient 
vector w T is calculated based upon the TEQ filter coeffi- 
cients defined in step 1106. In effect, the response z± is 
compared, as in step U10, with the target impulse response 
channel (tj vector b to generate optimal TEQ filter coeffi- 
cients that result in minimizing the error of e^. By minimiz- 
ing the error between the channels, the inventive method 
arrives at a set of coefficients wk and vector w T to yield a 
shortened channel impulse response z k that approximates the 
target channel impulse response. In a one embodiment, the 
Zfc vector preferably has a length N^. and is the target impulse 
response vector. 

[0453] Referring now to the flow chart of FIG. 12, one 
embodiment of the operation of step 1106 is described. In 
step 1202 the original long channel impulse response h± is 
determined In step 1204, the noise autocorrelation 
sequence, r v , is determined. In step 1206, the delay value, d, 
is fixed as the starting location of the non-zero segment of 
the target impulse response. Similarly, a value for L is 
chosen and fixed as the center of the range of possible values 
that L can take, where L is calculated as: 




[0454] in step 1208, where Lis an integer value used to set 
the target value of the length of the desired signal Kj.. With 
h^ d and L known or fixed at certain values, the error 
between the parallel branches z± and b± may be minimized 
in step 1210. 

[0455] The specifics of step 1210 are discussed in more 
detail with reference to FIG. 13. In particular, FIG. 13 
details an embodiment of a process used to calculate the 
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TEQ filter {wj and HR filter {bjj coefficients. In order to 
minimize the error between z k and a MLC-TEQ math- 
ematical method is constructed in step 1300. In order to 
avoid the trivial all-zeros solution, a finite length constraint 
is imposed on b or w in step 1302. In the example of step 
1302, one of the received signal channels, or "taps", of b is 
constrained as b L =c, where L G{0, . . . , N^l} and c est, 
and c*0. In this example, c is chosen to be c=l; this is done 
without a loss of generality. 

[0456] A matrix F is constructed in step 1304 as: 



[0465] It should be understood that the choices of the 
parameters d and L has a certain impact upon the perfor- 
mance of the TEQ filter with respect to a chosen application. 
Relating specifically to a DMT application, a reasonable 
capacity performance is obtained by choosing d as the 
location of the maximum value of the impulse response 

[0466] Further, it has been observed that for a typical 
scenario, there is a relatively wide region around where 



Ontl-IJ °NT-L-IA Oyr-L-U tffT-t^l^T-L-U °NT-Lr- l,NC+NTEQ-NT -d- 



[0457] A covariance matrix, R^ is constructed in step 
1306 as: 



RiNTEQJfTEQ ~ 



r„(0) 



r,(0) 



rv(*7Efi-2) 



the performance of the filter does not change, where d c is the 
effective delay corresponding to the starting location of the 
non-zero segment of the impulse response. For the L param- 
eter, a reasonable choice is given by the equation: 



[0458] A matrix H is constructed in step 1308 as: 



ho hi ... 
0 ho ... 



... A* e -» 0 

... hf/ c ~2 hfi{ c -\ 



0 ho 



hp/.-i 



[0459] Further, in step 1310, a matrix h is constructed for 
d+L<N TEQ as: 

A=t^d*L • • * AoOujteq-a-i^i] 

[0460] and for d+L^N^Q in step 1312 as: 

[0461] The M^q FIR TEQ coefficients {wj are calcu- 
lated in step 1314, as are the target impulse response 
coefficients {bj in step 1316. 

[0462] Returning now to FIG. 12, once the above values 
have been calculated, the various F, R^ H and B and the 
coefficients {w k } are then used to calculate the Equalizer 
Coefficient vector w T in step 410 as: 

w T ~fiiRja x 2 +Hii-F T f)M T )- 1 
[0463] where: h is a function of the impulse response 
coefficient; R v ntecxnteo is a noise autocorrelation func- 
tion; a x 2 is the variance of x^ H is a function of the channel 
impulse response; I is a function of the energy of the ISI and 
I CI interferences; F is an intermediate variable; and 



[0464] or the change from w Q T to Wj^^^ 1 



[0467] where fx] is a function representing the smallest 
integer that is greater than or equal to the number x. 
Therefore, L is chosen as the center of the range of possible 
values that L can take. 

[0468] From the detailed description above, it will be clear 
to one skilled in the art that the present invention has many 
benefits. In particular, the fundamental advantages are two 
fold: it provides linear TEQ filtering with near-optimal 
performance, and it is structurally suitable for adaptive 
implementation on a real system. In particular, and in 
contrast to the filtering methods currently used in the art, this 
method includes: the Target Impulse Response (TIR) vector 
is the combined response of the transmission channel and 
the TEQ filter is used as the search vector, the MLC-TEQ 
mathematical method constrains one element of the TIR 
vector to be equal to a constant and the element can be any 
element of the TTR vector, the error is computed using the 
difference between the TEQ output and the TIR filter output; 
the TEQ filter coefficients are computed by minimizing the 
Mean Square Error (MSE) criterion; and the mathematical 
method provides a family of TEQ vectors which are 
parametized by two quantities, the location of the con- 
strained element of the TIR vector and the delay value used 
in the computation. 

[0469] In its Hadrian™ and Antoninus™ products, Virata 
Corporation of Santa Clara, Calif., extends the benefits of an 
integrated full-rate ADSL CPE chipset. The inventive con- 
cepts discussed above may be incorporated into an inte- 
grated ADSL CPE processor and Analog Front End/Line 
Driver (AFE/LD) chipset, such as Virata Corporation's 
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Hadrian™ and Antoninus™, which may be used in a wide 
variety of applications. According to one embodiment of the 
present invention, the inventive concepts related to echo 
cancellation functionality and TEQ functionality may be 
incorporated into the Hadrian and/or Antoninus products. 

[0470] FIGS. 14a and 146 are schematic diagrams of 
hardware architectures in which the inventive aspects of the 
present invention may be incorporated. FIG. 14a illustrates 
a block diagram of a chip architecture 1400 for Hadrian 1, 
which includes an integrated ADSL CPE processor 1402 and 
AFE/LD chipset 1440. The integrated ADSL CPE processor 
1402 may be delivered as a multi-chip module (MCM) 
including 23 chip 1412 and Domitian Chip 1410. 73 chip 
1412 is full-rate ADSL Physical Layer (PHY) for CPE with 
on-chip DP (e.g., DSL PHY Processor) code storage 
memory. Domitian chip 1410 is a high performance com- 
munications processor for CPE with Ethernet PHY. Hadrian 
AFE/LD 1440 is an AFE with integrated line driver for 
full-rate ADSL CPE, as a part of the chip-set. The Hadrian 
AFE/LD 1440 is an integral part of the complete solution. 
The Hadrian Integrated ADSL CPE Processor 1402 is com- 
patible with the Hadrian AFE/LD 1440. 

[0471] The Hadrian-1 chipset, as shown in FIG. 14a, is 
designed to be fully compliant with the full-rate ADSL ITU 
standard G.992.1 (G.dmt) and the splitterless ADSL ITU 
standard G.992.2 (G.lite). Hadrian-i fully supports Annex A, 
both Frequency Division Duplex (FDD) and Echo Cancelled 
(EC) modes, and Annex B of G.992.1 and Annex A and B 
of G.992.2. In addition, the initialization procedure is fully 
compliant with G.994.1 (Rev. 2) (G.hs.bis). Other features 
may include being fully compliant with the splitterless 
ADSL ITU standard G.992.2 (G.lite) and supporting both 
Annex A and B. Hadrian 1 may also include a communica- 
tions processor with Protocol Processor (PP) for protocol 
execution and Network Processor (NP) for data manipula- 
tion. Other features may include DSL PHY Processor (DP) 
for implementing modem firmware; Internal 10/100 Ether- 
net PHY (disabled when Media Independent Interface (Mil) 
interface is used); 10/100 Ethernet MAC with Mil for 
connecting to either internal 10/100 Ethernet PHY or exter- 
nal 10/100 multi-PHY; External Peripheral Bus (EPB); 
Synchronous Transfer Mode (STM): Serial frame data (e.g., 
Higb-Level Data link Control (HDLQ); simultaneous STM 
and ATM support; bearer channels ASO, AS1, LSO, LSI for 
both STM and ATM; single and dual latency paths for both 
downstream and upstream; four framing modes, e.g., modes 
0, 1, 2, and 3; Network Timing Reference (NTR) transport 
support; dying gasp detection and handling; Universal Serial 
Bus (USB) 1.1 slave; integrated line driver and \oltage 
Controlled Crystal Oscillator (VCXO) for the AFE; general 
purpose input/output (GPIO); universal asynchronous 
receiver/transmitter (UAM); In -Circuit Emulator (ICE); 
and/or Joint Test Action Group (JTAG). 

[0472] In particular, standard compliance includes the 
power spectral density (PSD) of the transmit signals for all 
supported annex and corresponding modes; the use of fre- 
quency bins for transmit and receive signal including pilot 
bins; handshake procedure as specified in G.994. 1; various 
initialization, training, and message signals; state machine 
and timing sequence; DMT modulation; framing modes; 
bearer channels and dual latency paths; Embedded Overhead 
Control (EOQ channel; ADSL Overhead Control (AOQ 
Channel; forward error correction; inter-leaver operation; 



and ATM and STM functionalities. Hadrian-1 is interoper- 
able with standard compliant central office (CO) equipment, 
including various DSL Access Multiplexor (DSLAM) ven- 
dors. 

[0473] FIG. 14b illustrates Hadrian 2, which includes an 
integrated ADSL CPE Processor 1452 and AFE/LD 1460 
chipset, as shown by 1450. Hadrian 2 ADSL CPE processor 
1452 may be delivered as a single-chip and cost-optimized 
device, which includes 73 chip and Domitian chip. 73 chip 
is a full-rate ADSL PHY for CPE without on-chip DP code 
storage memory. Domitian chip is a communications pro- 
cessor for CPE with Ethernet PHY and with SDRAM 
interface capable of handling various processors, such as 
Network Processor (NP), Protocol Processor (PP) and DSL 
PHY Processor (DP). Hadrian AFE/LD 1460 is an AFE with 
integrated line driver for full-rate ADSL PHY for CPE and 
will also be included as part of a chip-set. Hadrian 2 may 
support the above listed features for Hadrian 1 in addition to 
STM: TDM H. 110, ST-Bus. 

[0474] The Domitian communications processor has been 
architected around the principle that there are two functions 
of a communications system which may use different archi- 
tectures. These two functions are data manipulation and 
protocol execution. 

[0475] Data manipulation involves many word, byte and 
bit changes on small amounts of data. These operations have 
a deterministic character and are finished in a short time 
span. The code controlling these operations are small and 
compact, often written in assembly language, and involves 
a detailed knowledge of low level data protocols (Open 
Source Initiative (OSI) layer 1 to layer 3) and the available 
hardware. Data is processed in cells and packets of limited 
size. 

[0476] The protocol stack routines are multitasking, 
involving a large amount of memory and although time 
limits are set, these are not as strict as for the low level data 
manipulation routines. The data consists of frames which 
can be large (e.g., up to 64K). 

[0477] The Domitian architecture reflects these two types 
of function. The Protocol Processor (PP) has a 4 KB cache 
and is thus suited for running large programs. It has no direct 
access to the network data port but can exchange data with 
the Network Processor (NP) using shared Synchronous 
Dynamic Random-Access Memory (SDRAM). Large 
frames of data may be easily exchanged between the two 
processors using this method. The main PP tasks are to set 
up and close connections, get data from the NP, convert the 
data from one protocol type to another and pass them back 
to the NP again to be transferred to another network inter- 
face. For protocol support, it may also generate its own data 
frames for signalling, operating and maintenance. The PP 
may also communicate with, and control external hardware 
using the External Peripheral Bus (EPB). 

[0478] The Network Processor (NP) has a dedicated local 
program memory (e.g., 16 KB of SRAM) and may directly 
interact with network ports and associated hardware. Amain 
tasks involves assembling incoming packets into frames for 
each of the network connections. It knows the status of each 
network port and responds to any incoming packet/cell in 
time to prevent buffer overflow. While receiving packets/ 
cells, it assembles them into frames, keeps track of errors, 
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gathers statistical data, performs policing, extraction of 
Operation, Administration and Maintenance (OAM) cells, 
etc. When transmitting data, it segments frames into packets/ 
cells, transmits at a specified rate by inserting idle cells, adds 
Cyclic Redundancy Check (CRC) bytes, inserts OAM cells, 
etc. 

[0479] FIG. 15 is a block diagram of a physical media 
dependent (PMD) layer 1500 of a ADSL CPE chip in which 
the inventive aspects of the present invention may be 
incorporated. FIG. 15 illustrates core building blocks for the 
PMD layer and basic signal flow. As shown in FIGS. 14a 
and 14b, PMD 1430 is implemented in ADSL CPE Proces- 
sor 1402 and 1452. 

[0480] PMD layer may include a variety of functions, 
which may include a DSL PHY Processor (DP) for imple- 
menting modem firmware; a complete DMT engine includ- 
ing IFFT (e.g., 64-point, 128-point or multiples thereof) on 
the transmit path and a point Fast Fourier Transform (FFT) 
(e.g., 512-point) on the receive path, Tx/Rx Bit Extract, 
Cyclic Prefix, and Constellation encoder/decoder; a Forward 
Error Correction (FEC) mechanism including the mandatory 
Reed Solomon (RS) encoder/decoder and optional Trellis 
encoder/decoder, both fast and interleaved paths are sup- 
ported along with the Interleaver/deinterleaver; Training 
Signals Generator and Sync Symbol Generator; Course/Fine 
Gain Scaling and Tx/Rx Clip Scaling; Programmable Trans- 
mit Filter and Receive Noise Reduction Filter; Digital 
Automatic Gain Control (AGC); Adaptive Time domain 
Equalization (TEQ); Adaptive Frequency domain Equaliza- 
tion (FEQ); Tone ordering and Bit Swap; Clock Recovery 
Loop (CRL); Sigma-Delta Interpolator and Decimator for 
Digital to Analog Converter (DAQ and Analog to Digital 
Converter (ADC); and Adaptive Echo Cancellation Filter. 

[0481] As shown in FIG. 15, adaptive echo canceller 1510 
may embody the inventive aspects related to echo cancel- 
lation of the present invention discussed above. In addition, 
adaptive TEQ 1520 may embody the inventive aspects of the 
TEQ of the present invention discussed above. 

[0482] FIG. 16 is a block diagram of a transmission 
convergence (TC) layer 1600 of a ADSL CPE chip in which 
the inventive aspects of the present invention may be 
incorporated. As shown in FIGS. 14a and 146, TC 1420 is 
implemented in ADSL CPE Processor 1402 and 1452. The 
TC layer may support a variety of features, such as ATM- 
TC; STM-TC; dual-bearer; simultaneous ATM and STM 
transport; Bearer Channel Switch; UTOPIA 2 interface; 
ADSL Overhead Control/Embedded Overhead Control 
(AOC/EOC) Buffers; Tx/Rx CRC; and Scrambler/Descram- 
bler. 

[0483] FIG. 17 is a block diagram of an analog front end 
device in which the inventive aspects of the present inven- 
tion may be incorporated. Hadrian AFE/LD, as shown by 
1440 and 1460 in FIGS. 14a and 146, with integrated line 
driver is an integral part of the complete solution. To achieve 
enhanced performance the Hadrian AFE/LD and its Hadrian 
Integrated ADSL CPE Processor counterpart are designed 
with tightly coupled architecture. AFE features may include 
an integrated line driver; an integrated VCXO; 16-bit DAC 
with 14.5 effective bits; 16-bit ADC with 13.5 effective bits; 
industrial temperature range: -40C — 85C; and external fil- 
ters. 

[0484] FIGS. 18a-18c are a block diagram of applications 
in which the inventive aspects of the present invention may 



be incorporated. FIG. 18a illustrates an Ethernet Router/ 
Bridge 1800 which incorporates the Hadrian product. FIG. 
18b illustrates a USB Modem and USB-Attached Gateway 
1820 which incorporates the Hadrian product. FIG. 18c 
illustrates an Integrated Access Device (IAD) 1840 which 
incorporates the Hadrian product. 

[0485] Virata's Antoninus is a full-rate ADSL CPE PHY 
device comprised of two chips, which includes an ADSL 
CPE Processor called Z3 (as discussed above in connection 
with Hadrian 1) and a AFE/LD chipset, as illustrated in FIG. 
17. The 73 chip is a full-rate ADSL PHY for CPE with 
on-chip DP (e.g., DSL PHY processor) code storage 
memory. The ADSL CPE processor 23 is also used in 
Hadrian 1, discussed above. Similarly, the inventive aspects 
related to the echo canceller and the TEQ may be imple- 
mented in the Antoninun product Antoninus AFE/LD is an 
AFE with integrated line driver for full-rate ADSL CPE, 
which is also included as part of the chip-set. 

[0486] While the foregoing description includes many 
details and specificities, it is to be understood that these have 
been included for purposes of explanation only, and are not 
to be interpreted as limitations of the present invention. 
Many modifications to the embodiments described above 
can be made without departing from the spirit and scope of 
the invention. 

[0487] The present invention is not to be limited in scope 
by the specific embodiments described herein. Indeed, vari- 
ous modifications of the present invention, in addition to 
those described herein, will be apparent to those of ordinary 
skill in the art from the foregoing description and accom- 
panying drawings. Thus, such modifications are intended to 
fall within the scope of the following appended claims. 
Further, although the present invention has been described 
herein in the context of a particular implementation in a 
particular environment for a particular purpose, those of 
ordinary skill in the art will recognize that its usefulness is 
not limited thereto and that the present invention can be 
beneficially implemented in any number of environments for 
any number of purposes. Accordingly, the claims set forth 
below should be construed in view of the full breath and 
spirit of the present invention as disclosed herein. 

1. A weighted vector error echo canceller comprising: 

an adaptive echo canceller filter having an input adapted 
to receive a transmit signal, the adaptive echo canceller 
filter being adapted to generate a signal representative 
of an echo signal associated with .the transmit signal, 
wherein the adaptive echo canceller filter functions at a 
transmit rate; and 

an interpolation filter bank having an input adapted to 
receive the representative signal, the interpolation filter 
being adapted to generate a filtered output vector at a 
plurality of branches, wherein the filtered output vector 
is subtracted from a receive signal vector for generating 
a residual echo vector and wherein the plurality of 
branches corresponds to the receive signal vector. 

2. The echo canceller of claim 1, further comprising 

an error weighting multi-input-multi-output filter having 
an input adapted to receive the residual echo vector, the 
error weighting multi-input-multi-output filter being 
adapted to generate a weighted error vector for training 
the adaptive echo canceller filter. 
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3. The echo canceller of claim 1, wherein the adaptive 
echo canceller filter is a finite impulse response filter. 

4. The echo canceller of claim 1, further comprising a 
reference signal interpolation filter having an input adapted 
to receive a transmit signal, the reference signal interpola- 
tion filter being adapted to generate a reference signal vector. 

5. The echo canceller of claim 1, further comprising a 
vectorization unit having an input adapted to receive an 
input signal from an analog front end, the vectorization unit 
being adapted to generate a receive signal vector. 

6. The echo canceller of claim 5, wherein the vectoriza- 
tion unit further comprising at least one delay unit for 
delaying the input signal by a predetermined number of 
samples. 

7. The echo canceller of claim 5, wherein the vectoriza- 
tion unit further comprising at least one down sampling 
block for decimating the input signal by a predetermined 
factor. 

8. The echo canceller of claim 1, wherein training func- 
tionality associated with the adaptive echo canceller filter 
functions at the transmit rate. 

9. The echo canceller of claim 1, wherein updating 
functionality associated with the adaptive echo canceller 
filter functions at the transmit rate. 

10. The echo canceller of claim 8, wherein the training 
functionality involves adapting a plurality of coefficients of 
the adaptive echo canceller filter based on at least one of a 
plurality of up sampled reference signal vectors and 
weighted error signal vectors. 

11. A method for implementing a weighted vector error 
echo canceller, the method comprising the steps of: 

receiving a transmit signal; 

generating a signal representative of an echo signal asso- 
ciated with the transmit signal at a transmit rate; 

receiving the representative signal; and 

generating a filtered output vector at a plurality of 
branches, wherein the filtered output vector is sub- 
tracted from a receive signal vector for generating a 
residual echo vector and wherein the plurality of 
branches corresponds to the receive signal vector. 

12. The method of claim 11, further comprising the steps 
of: 

receiving the residual echo vector; 

generating a weighted error vector for training an adaptive 
echo canceller filter. 

13. The method of claim 11, wherein the adaptive echo 
canceller filter is a finite impulse response filter. 

14. The method of claim 11, further comprising the step 

of: 

generating a reference signal vector. 

15. The method of claim 11, further comprising the steps 

of: 

receiving an input signal from an analog front end; and 

generating a receive signal vector. 

16. The method of claim 15, further comprising the step 

of: 

delaying the input signal by a predetermined number of 
samples. 



17. The method of claim 15, further comprising the step 
of: 

decimating the input signal by a predetermined factor. 

18. The method of claim 11, wherein training function- 
ality associated with the adaptive echo canceller filter func- 
tions at the transmit rate. 

19. The method of claim 11, wherein updating function- 
ality associated with the adaptive echo canceller filter func- 
tions at the transmit rate. 

20. The method of claim 18, further comprising the step 
of: 

adapting a plurality of coefficients of the adaptive echo 
canceller filter based on at least one of a plurality of up 
sampled reference sequences and weighted error signal. 

21. The echo canceller of claim 1, wherein a least mean 
square update rule is applied to train at least one coefficient 
of the echo canceller filter, the least means square update 
rule is defined as 
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where w represents a coefficient vector, fi represents step 
size, e represents the error signal, e represents a 
weighted error vector, X represents a transmit signal 
matrix, F represents an interpolation filter bank matrix 
and H represents a Toeplitz matrix. 

22. The echo canceller of claim 1, wherein a least mean 
square update rule is applied to train at least one coefficient 
of the echo canceller filter, the least means square update 
rule is defined as 

where w represents a coefficient vector, fi represents step 
size, S represents a reference filter vector, and e rep- 
resents a weighted error vector. 

23. The method of claim 11, wherein a least mean square 
update rule is applied to train at least one coefficient of the 
echo canceller filter, the least means square update rule is 
defined as 



where w represents a coefficient vector, p represents step 
size, e represents the error signal, e represents a 
weighted error vector, X represents a transmit signal 
matrix, F represents an interpolation filter bank matrix 
and H represents a Toeplitz matrix. 
24. The method of claim 11, wherein a least mean square 
update rule is applied to train at least one coefficient of the 
echo canceller filter, the least means square update rule is 
defined as 
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where w represents a coefficient vector, fi represents step 
size, S represents a reference filter vector, and e rep- 
resents a weighted error vector. 
25. The echo canceller of claim 1, wherein a steady state 
operation is supported by passing the residual echo vector to 
a receiver. 



26. The method of claim 11, wherein a steady state 
operation is supported, further comprising the step of: 

passing the residual echo vector to a receiver. 



